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ALU Arithmetic Logic Unit 
ATB Area Translation Buffer 
BIU Bus Interface Unit 
BTC Branch Target Cache 
CPU Central Processing Unit 
DC Data Cache 
DMMU Data MMU 
DTLB Data TLB 
DU Debug Unit 
EA Effective address 
FPU Floating-Point Unit 
GPR General-Purpose Register 
IC Instruction Cache 
IMMU Instruction MMU 
ITLB Instruction TLB 
MMU Memory Management Unit 
OR1K OpenRISC 1000 Architecture 
ORBIS OpenRISC Basic Instruction Set 
ORFPX OpenRISC Floating-Point eXtension 
ORVDX OpenRISC Vector/DSP eXtension 
PC Program Counter 
PCU Performance Counters Unit 
PIC Programmable Interrupt Controller 
PM Power Management 
PTE Page Table Entry 
R/W Read/Write 
RISC Reduced Instruction Set Computer 
SMP Symmetrical Multi-Processing 
SMT Simultaneous Multi-Threading 
SPR Special-Purpose Register 
SR Supervison Register 
TLB Translation Lookaside Buffer 
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1 About this Manual 


1.1 Introduction 


The OpenRISC 1000 system architecture manual defines the architecture for a 
family of open-source, synthesizable RISC microprocessor cores. The OpenRISC 1000 
architecture allows for a spectrum of chip and system implementations at a variety of 
price/performance points for a range of applications. It is a 32/64-bit load and store RISC 
architecture designed with emphasis on performance, simplicity, low power requirements, 
and scalability. The OpenRISC 1000 architecture targets medium and high performance 
networking and embedded computer environments. 


This manual covers the instruction set, register set, cache management and 
coherency, memory model, exception model, addressing modes, operands conventions, 
and the application binary interface (ABI). 


This manual does not specify implementation-specific details such as pipeline 
depth, cache organization, branch prediction, instruction timing, bus interface etc. 


1.2 Authors 


If you have contributed to this manual but your name isn't listed here, it is not 
meant as a slight — We simply don't know about it. Send an email to the maintainer(s), 
and we'll correct the situation. 








Name E-mail Contribution 
Damjan Lampret damjanl@opencores.org Initial document 








Chen-Min Chen jimmy@ee.nctu.edu.tw Some notes 








Marko Mlinar markom@opencores.org Fast context switches 


Johan Rydberg jrydberg@opencores.org ELF section 
Matan Ziv-Av matan@svgalib.org Several suggestions 















































Chris Ziomkowski chris@opencores.org Several suggestions 
Greg McGary reg@mcgary.or I.cmov, trap exception 
Bob Gardner Native Speaker Check 
Rohit Mathur rohitmathurs@opencores.org Technical review and 
corrections 

Maria Bolado mbolado@teisa.unican.es Technical review and 
corrections 

ORSoC yannv@opencores.org Technical review and 
Yann Vernier corrections 

Julius Baxter julius@opencores.org Architecture revision 
information 

















Stefan Wallentowitz stefan@wallentowitz.de Multicore and corrections 











Stafford Horne Multicore Contributions 











Stefan Kristiansson | stefan.kristiansson@saunalahii.fi | Atomic instructions 


shorne@gmail.com 














Table 1-1. Authors of this Manual 
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The revision history of this manual is presented in the table below. 
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Revision Modifications Arch. Ver 
Date (Maj.Min) — 
Doc Rev 
15/Mar/2000 | Damjan Lampret Initial document 0.0-0 
7/Apr/2001 Damjan Lampret First public release 0.0-1 
22/Apr/2001 | Damjan Lampret Incorporated changes from Johan and 0.0-2 
Matan 
16/May/2001 | Damjan Lampret Changed SR, Debug, Exceptions, TT, 0.0-3 
PM. Added I.cmov, |.ff1, etc. 
23/May/2001 | Damjan Lampret Added SR[SUMRA], configuration 0.0-4 
registerc etc. 
24/May/2001 | Damjan Lampret Changed virtually almost all chapters in 0.0-5 
some way — major change is addition of 
configuration registers. 
28/May/2001 | Damjan Lampret Changed addresses of some SPRs, 0.0-6 
removed group SPR group 11, added 
DCR[CT]=7. 
24/Jan/2002 Marko Mlinar Major check and update 0.0-7 
9/Apr/2002 Marko Mlinar PICPR register removed; I.sys 0.0-8 
convention added; mtspr/mfspr now use 
bitwise OR instead of sum 
28/July/2002 Jeanne First overall review & layout adjustment 0.0-9 
Wiegelmann 
20/Sep/2002 Rohit Mathur Second overall review 0.0-10 
12/Jan/2003 | Damjan Lampret Synchronization with orLksim and 0.0-11 
OR1200 RTL. Not all chapters have been 
checked. 
26/Jan/2003 | Damjan Lampret Synchronization with orLksim and 0.0-12 
OR1200 RTL. From this revision on the 
manual carries revision number 1.0 and 
parts of the architecture that are 
implemented in OR1200 will no longer 
change because OR1200 is being 
implemented in silicon. Major parts that 
are not implemented in OR1200 and 
could change in the future include 
ORFPX, ORVDX, PCU, fast context 
switching, and 64-bit extension. 
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Luis Vitério Cargnini. Fixed typos in 
various chapters reported by Matjaz 
Breskvar. Changed description of PICSR. 
Updated ABI chapter based on agreed 
ABI from the openrisc mailing list. 
Removed DMR1[ETE], clearly defined 
watchpoints&breakpoint, split long 
watchpoint chain into two, removed 
WP10 and removed DMR1[DXFW], 
updated DMR2. Fixed FP definition 
(added FP exception. FPCSR register). 
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15/Nov/2005 | Damjan Lampret Corrected description of I.sfXXui (arch 0.0-15 
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empty 32-bit implementation for 
lv.pack/unpack to prevent blank pages 
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Table 1-2. Revision History 


1.4 Work in Progress 


This document is work in progress. Anything in the manual could change until we 
have made our first silicon. The latest version is always available from OPENCORES 
revision control (Subversion as of this writing). See details about how to get it on 
Www.opencores.org. 


We are currently looking for people to work on and maintain this document. If you 
would like to contribute, please send an email to one of the authors. 





1.5 Fonts in this Manual 


In this manual, fonts are used as follows: 
¥ Typewriter font is used for programming examples. 
Bold font is used for emphasis. 
v UPPER CASE items may be either acronyms or register mode fields that can be 
written by software. Some common acronyms appear in the glossary. 
v¥ Square brackets [] indicate an addressed field in a register or a numbered register in a 
register file. 


IN 
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1.6 Conventions 





























1L.mnemonic Identifies an ORBIS32/64 instruction. 

lv.mnemonic Identifies an ORVDX32/64 instruction. 

1f.mnemonic Identifies an ORFPX32/64 instruction. 

Ox Indicates a hexadecimal number. 

rA Instruction syntax used to identify a general purpose register 

REG[FIELD] Syntax used to identify specific bit(s) of a general or special purpose 
register. FIELD can be a name of one bit or a group of bits or a 

numerical range constructed from two values separated by a colon. 

Xx In certain contexts, this indicates a ‘don't care’. 

N In certain contexts, this indicates an undefined numerical value. 

Implementation An actual processor implementing the OpenRISC 1000 architecture. 

Unit Sometimes referred to as a coprocessor. An implemented unit 

usually with some special registers and controlling instructions. It 
can be defined by the architecture or it may be custom. 

Exception A vectored transfer of control to supervisor software through an 

exception vector table. A way in which a processor can request 
operating system assistance (division by zero, TLB miss, external 
interrupt etc). 

Privileged An instruction (or register) that can only be executed (or accessed) 

when the processor is in Supervisor mode (when SR[SM]=1). 








Table 1-3. Conventions 


1.7 Numbering 


All numbers are decimal or hexadecimal unless otherwise indicated. The prefix Ox 
indicates a hexadecimal number. Decimal numbers don't have a special prefix. Binary and 
other numbers are marked with their base. 
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2 Architecture Overview 


This chapter introduces the OpenRISC 1000 architecture and describes the general 
architectural features. 


2.1 Features 


The OpenRISC 1000 architecture includes the following principal features: 
v¥ Acompletely free and open architecture. 


YA linear, 32-bit or 64-bit logical address space with implementation-specific physical 
address space. 


v¥ Simple and uniform-length instruction formats featuring different instruction set 
extensions: 


> OpenRISC Basic Instruction Set (ORBIS32/64) with 32-bit wide instructions 
aligned on 32-bit boundaries in memory and operating on 32- and 64-bit data 


> OpenRISC Vector/DSP eXtension (ORVDX64) with 32-bit wide instructions 
aligned on 32-bit boundaries in memory and operating on 8-, 16-, 32- and 64-bit 
data 


> OpenRISC Floating-Point eXtension (ORFPX32/64) with 32-bit wide instructions 
aligned on 32-bit boundaries in memory and operating on 32- and 64-bit data 


v¥ Two simple memory addressing modes, whereby memory address is calculated by: 
> addition of a register operand and a signed 16-bit immediate value 


> addition of a register operand and a signed 16-bit immediate value followed by 
update of the register operand with the calculated effective address 


v¥ Two register operands (or one register and a constant) for most instructions who then 
place the result in a third register 


v¥ Shadowed or single 32-entry or narrow 16-entry general purpose register file 
¥ Optional branch delay slot for keeping the pipeline as full as possible 


¥ Support for separate instruction and data caches/MMUs (Harvard architecture) or for 
unified instruction and data caches/MMUs (Stanford architecture) 


vA flexible architecture definition that allows certain functions to be performed either 
in hardware or with the assistance of implementation-specific software 


¥ Number of different, separated exceptions simplifying exception model 
vY Fast context switch support in register set, caches, and MMUs 


2.2 Introduction 


The OpenRISC 1000 architecture is a completely open architecture. It defines the 
architecture of a family of open source, RISC microprocessor cores. The OpenRISC 1000 
architecture allows for a spectrum of chip and system implementations at a variety of 
price/performance points for a range of applications. It is a 32/64-bit load and store RISC 
architecture designed with emphasis on performance, simplicity, low power requirements, 
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and scalability. OpenRISC 1000 targets medium and high performance networking and 
embedded computer environments. 


Performance features include a full 32/64-bit architecture; vector, DSP and 
floating-point instructions; powerful virtual memory support; cache coherency; optional 
SMP and SMT support, and support for fast context switching. The architecture defines 
several features for networking and embedded computer environments. Most notable are 
several instruction extensions, a configurable number of general-purpose registers, 
configurable cache and TLB sizes, dynamic power management support, and space for 
user-provided instructions. 


The OpenRISC 1000 architecture is the predecessor of a richer and more powerful 
next generation of OpenRISC architectures. 


The full source for implementations of the OpenRISC 1000 architecture is 
available at www.opencores.org and is supported with GNU software development tools 
and a behavioral simulator. Most OpenRISC implementations are designed to be modular 
and vendor-independent. They can be interfaced with other open-source cores available at 


Wwww.opencores.org. 

Opencores.org encourages third parties to design and market their own 
implementations of the OpenRISC 1000 architecture and to participate in further 
development of the architecture. 





2.3 Architecture Version Information 


It is anticipated that revisions of the OR1K architecture will come about as 
architectural modifications are made over time. This document shall be valid for the latest 
version stated in it. Each implementation should indicate the minimum revision it 
supports in the Architecture Version Register (AVR). 


The following table lists the versions and their release date. 














Version Date Summary 
0.0 November Initial architecture specification. 
2005 
1.0 December First version. 
2012 
1.1 April 2014 Atomic instructions additions. 
1.2 April 2015 SPRs in user mode, multicore. 











Table 2-1: Architecture Version Information 
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3 Addressing Modes and 
Operand Conventions 


This chapter describes memory-addressing modes and memory operand 
conventions defined by the OpenRISC 1000 system architecture. 


3.1 Memory Addressing Modes 


The processor computes an effective address when executing a memory access 
instruction or branch instruction or when fetching the next sequential instruction. If the 
sum of the effective address and the operand length exceeds the maximum effective 
address in logical address space, the memory operand wraps around from the maximum 
effective address through effective address 0. 


3.1.1 Register Indirect with Displacement 


Load/store instructions using this address mode contain a signed 16-bit immediate 
value, which is sign-extended and added to the contents of a general-purpose register 
specified in the instruction. 


instruction 









A 
Sign Extended Imm 









Effective Address 








Figure 3-1. Register Indirect with Displacement Addressing 


Figure 3-1 shows how an effective address is computed when using register 
indirect with displacement addressing mode. 


3.1.2 PC Relative 


Branch instructions using this address mode contain a signed 26-bit immediate 
value that is sign-extended and added to the contents of a Program Counter register. 
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Before the execution at the destination PC, instruction in delay slot is executed if the ND 
bit in CPU Configuration Register (CPUCFGR) is set. 









Instruction 





Sign Extended Imm 

















V 
Effective Address 


Figure 3-2. PC Relative Addressing 


Figure 3-2 shows how an effective address is generated when using PC relative 
addressing mode. 


3.2 Memory Operand Conventions 


The architecture defines an 8-bit byte, 16-bit halfword, a 32-bit word, and a 64-bit 
doubleword. It also defines IEEE-754 compliant 32-bit single precision float and 64-bit 
double precision float storage units. 64-bit vectors of bytes, 64-bit vectors of halfwords, 
64-bit vectors of singlewords, and 64-bit vectors of single precision floats are also 
defined. 


Type of Data Length in Bytes Length in Bits 























Byte 1 8 

Halfword (or half) 2 16 
Singleword (or word) 4 32 
Doubleword (or double) 8 64 
Single precision float 4 32 
Double precision float 8 64 
Vector of bytes 8 64 

Vector of halfwords 8 64 
Vector of singlewords 8 64 
Vector of single precision floats 8 64 








Table 3-1. Memory Operands and their sizes 
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3.2.1 Bit and Byte Ordering 


Byte ordering defines how the bytes that make up halfwords, singlewords and 
doublewords are ordered in memory. To simplify OpenRISC implementations, the 
architecture implements Most Significant Byte (MSB) ordering — or big endian byte 
ordering by default. But implementations can support Least Significant Byte (LSB) 
ordering if they implement byte reordering hardware. Reordering is enabled with bit 
SR[LEE]. 

The figures below illustrate the conventions for bit and byte numbering within 
various width storage units. These conventions hold for both integer and floating-point 
data, where the most significant byte of a floating-point value holds the sign and at least 
significant byte holds the start of the exponent. 


Table 3-2 shows how bits and bytes are ordered in a halfword. 


Bit 15 Bit 8 Bit 7 Bit O 





Byte address 0 Byte address 1 





Table 3-2. Default Bit and Byte Ordering in Halfwords 


Table 3-3 shows how bits and bytes are ordered in a singleword. 





MSB LSB 





| Byte address 0 | Byte address 1 | Byte address 2 Byte address 3 | 





Table 3-3. Default Bit and Byte Ordering in Singlewords and Single Precision Floats 


Table 3-4 shows how bits and bytes are ordered in a doubleword. 


























Bit 63 Bit 56 
MSB 
Byte address 0 Byte address 1 Byte address 2 Byte address 3 
Bit 7 Bit O 
LSB 
Byte address 4 Byte address 5 Byte address 6 Byte address 7 








Table 3-4. Default Bit and Byte Ordering in Doublewords, Double Precision Floats and all Vector 
Types 


3.2.2 Aligned and Misaligned Accesses 


A memory operand is naturally aligned if its address is an integral multiple of the 
operand length. Implementations might support accessing unaligned memory operands, 
but the default behavior is that accesses to unaligned operands result in an alignment 
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exception. See chapter Exception Model on page 254 for information on alignment 


exception. 


Current OR32 implementations (OR1200) do not implement 8 byte alignment, but 
do require 4 byte alignment. Therefore the Application Binary Interface (chapter 17) uses 
4 byte alignment for 8 byte types. Future extensions such as ORVDX64 may require 


natural alignment. 




















Operand Length addr[3:0] if aligned 
Byte 8 bits XXXX 
Halfword (or half) 2 bytes Xxx0 
Singleword (or word) 4 bytes Xx00 
Doubleword (or double) 8 bytes X000 
Single precision float 4 bytes Xx00 
Double precision float 8 bytes X000 
Vector of bytes 8 bytes X000 
Vector of halfwords 8 bytes X000 
Vector of singlewords 8 bytes X000 
Vector of single precision floats 8 bytes X000 





Table 3-5. Memory Operand Alignment 


OR32 instructions are four bytes long and word-aligned. 


www.openrisc.io 








1.2-1 








21 of 357 


OpenCores OpenRISC 1000 Architecture Manual August 20, 2017 


4 Register Set 


4.1 Features 


The OpenRISC 1000 register set includes the following principal features: 


¥ Thirty-two or sixteen 32/64-bit general-purpose registers — OpenRISC 1000 
implementations optimized for use in FPGAs and ASICs in embedded and similar 
environments may implement only the first sixteen of the possible thirty-two 
registers. 


vY All other registers are special-purpose registers defined for each unit separately 
and accessible through the |.mtspr/l.mfspr instructions. 


4.2 Overview 


An OpenRISC 1000 processor includes several types of registers: user level 
general-purpose and special-purpose registers, supervisor level special-purpose registers 
and unit-dependent registers. 


User level general-purpose and special-purpose registers are accessible both in 
user mode and supervisor mode of operation. Supervisor level special-purpose registers 
are accessible only in supervisor mode of operation (SR[SM]=1). 


Unit dependent registers are usually only accessible in supervisor mode but there 
can be exceptions to this rule. Accessibility for architecture-defined units is defined in 
this manual. Accessibility for custom units not covered by this manual will be defined in 
the appropriate implementation-specific manuals. 


4.3 Special-Purpose Registers 


The special-purpose registers of all units are grouped into thirty-two groups. Each 
group can have different register address decoding depending on the maximum 
theoretical number of registers in that particular group. A group can contain registers 
from several different units or processes. The SR[SM] bit is also used in register address 
decoding, as some registers are accessible only in supervisor mode. The |.mtspr and 
l.mfspr instructions are used for reading and writing registers. 


Unimplemented SPRs should read as zero. Writing to unimplemented SPRs will 
have no effect, and the |.mtspr instruction will effectively be a no-operation. 
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GROUP # UNIT DESCRIPTION 
0 System Control and Status registers 
1 Data MMU (in the case of a single unified MMU, groups 1 and 2 decode into a 
single set of registers) 
2 Instruction MMU (in the case of a single unified MMU, groups 1 and 2 decode 
into a single set of registers) 
3 Data Cache (in the case of a single unified cache, groups 3 and 4 decode into a 
single set of registers) 
4 Instruction Cache (in the case of a single unified cache, groups 3 and 4 decode 
into a single set of registers) 
5 MAC unit 
6 Debug unit 
7 Performance counters unit 
8 Power Management 
9 Programmable Interrupt Controller 
10 Tick Timer 
11 Floating Point unit 
12-23 Reserved for future use 
24-31 Custom units 














Table 4-1. Groups of SPRs 


An OpenRISC 1000 processor implementation is required to implement at least the 
special purpose registers from group 0. All other groups are optional, and registers from 
these groups are implemented only if the implementation has the corresponding unit. 
Which units are actually implemented may be determined by reading the UPR register 


from group 0. 


A 16-bit SPR address is made of 5-bit group index (bits 15-11) and 11-bit register 
index (bits 10-0). 












































Grp # Reg # Reg Name USER SUPV Description 
MODE MODE 

0 0 VR - R Version register 

0 1 UPR - R Unit Present register 

0 2 CPUCFGR - R CPU Configuration register 

0 3 DMMUCFGR - R Data MMU Configuration 
register 

0 4 IMMUCFGR - R Instruction MMU Configuration 
register 

0 5 DCCFGR - R Data Cache Configuration 
register 

0 6 ICCFGR - R Instruction Cache Configuration 
register 

0 DCFGR - R Debug Configuration register 

0 PCCFGR — R Performance Counters 

Configuration register 
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Grp # Reg # Reg Name USER SUPV Description 
MODE MODE 
0 9 VR2 - R Version register 2 
0 10 AVR - R Architecture version register 
0 11 EVBAR - R/W Exception vector base address 
register 
0 12 AECR - R/W Arithmetic Exception Control 
Register 
0 13 AESR - R/W Arithmetic Exception Status 
Register 
0 16 NPC - R/W PC mapped to SPR space (next 
PC) 
0 17 SR - R/W Supervision register 
0 18 PPC - R PC mapped to SPR space 
(previous PC) 
0 20 FPCSR R* R/W FP Control Status register 
0 21-28 ISRO-ISR7 R Implementation-specific 
registers 
0 32-47 | EPCRO-EPCR15 - R/W Exception PC registers 
0 48-63 | EEARO-EEAR15 - R/W Exception EA registers 
0 64-79 ESRO-ESR15 - R/W Exception SR registers 
0 128 COREID - R Core Identifier Register 
0 129 NUMCORES - R Number of Cores Register 
0 1024- GPRO-GPR511 - R/W GPRs mapped to SPR space 
1535 
A 0 DMMUCR - R/W Data MMU Control register 
1 1 DMMUPR - R/W Data MMU Protection Register 
1 2 DTLBEIR - Ww Data TLB Entry Invalidate 
register 
1 4-7 DATBMRO- - R/W Data ATB Match registers 
DATBMR3 
1 8-11 DATBTRO- - R/W Data ATB Translate registers 
DATBTR3 
1 512- DTLBWOMRO- - R/W Data TLB Match registers Way 
639 DTLBWOMR127 0 
1 640- DTLBWOTRO- - R/W Data TLB Translate registers 
767 DTLBWOTR127 Way 0 
1 768- DTLBW1MRO- - R/W Data TLB Match registers Way 
895 DTLBW1MR127 1 
1 896- DTLBW1TRO- - R/W Data TLB Translate registers 
1023 DTLBW1TR127 Way 1 
1 1024- DTLBW2MRO- - R/W Data TLB Match registers Way 
1151 DTLBW2MR127 2 
1 1152- DTLBW2TRO- - R/W Data TLB Translate registers 
1279 DTLBW2TR127 Way 2 
1 1280- DTLBW3MRO- - R/W Data TLB Match registers Way 
1407 DTLBW3MR127 3 
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Grp # Reg # Reg Name USER SUPV Description 
MODE MODE 
AE 1408- DTLBW3TRO- - R/W Data TLB Translate registers 
1535 DTLBW3TR127 Way 3 
2 0 IMMUCR - R/W Instruction MMU Control 
register 
2 1 IMMUPR - R/W Instruction MMU Protection 
Register 
2 2 ITLBEIR - Ww Instruction TLB Entry Invalidate 
register 
2 4-7 IATBMRO- - R/W Instruction ATB Match registers 
IATBMR3 
2 8-11 IATBTRO- - R/W Instruction ATB Translate 
IATBTR3 registers 
2 512- ITLBWOMRO- - R/W Instruction TLB Match registers 
639 ITLBWOMR127 Way 0 
2 640- ITLBWOTRO- - R/W Instruction TLB Translate 
767 ITLBWOTR127 registers Way 0 
2 768- ITLBW1MRO- - R/W Instruction TLB Match registers 
895 ITLBW1MR127 Way 1 
2 896- ITLBW1TRO- - R/W Instruction TLB Translate 
1023 ITLBW1TR127 registers Way 1 
2 1024- ITLBW2MRO- - R/W Instruction TLB Match registers 
1151 ITLBW2MR127 Way 2 
2 1152- ITLBW2TRO- - R/W Instruction TLB Translate 
1279 ITLBW2TR127 registers Way 2 
2 1280- ITLBW3MRO- - R/W Instruction TLB Match registers 
1407 ITLBW3MR127 Way 3 
2 1408- ITLBW3TRO- - R/W Instruction TLB Translate 
1535 ITLBW3TR127 registers Way 3 
3 0 DCCR - R/W DC Control register 
3 1 DCBPR Ww Ww DC Block Prefetch register 
3 2 DCBFR Ww Ww DC Block Flush register 
3 3 DCBIR - Ww DC Block Invalidate register 
3 4 DCBWR Ww Ww DC Block Write-back register 
3 5 DCBLR Ww Ww DC Block Lock register 
4 0 ICCR - R/W IC Control register 
4 1 ICBPR Ww Ww IC Block Prefetch register 
4 2 ICBIR - Ww IC Block Invalidate register 
4 3 ICBLR Ww Ww IC Block Lock register 
5 1 MACLO R/W* R/W* MAC Low 
5 2 MACHI R/W* R/W* MAC High 
6 0-7 DVRO-DVR7 - R/W Debug Value registers 
6 8-15 DCRO-DCR7 - R/W Debug Control registers 
6 16 DMR1 - R/W Debug Mode register 1 
6 17 DMR2 - R/W Debug Mode register 2 
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Grp # Reg # Reg Name USER SUPV Description 
MODE MODE 

6 18-19 | DCWRO-DCWR1 - R/W Debug Watchpoint Counter 
registers 

6 20 DSR - R/W Debug Stop register 

6 21 DRR - R/W Debug Reason register 

7 0-7 PCCRO-PCCR7 R* R/W Performance Counters Count 
registers 

7 8-15 PCMRO-PCMR7 - R/W Performance Counters Mode 
registers 

8 0 PMR - R/W Power Management register 

9 0 PICMR - R/W PIC Mask register 

9 2 PICSR - R/W PIC Status register 

10 0 TTMR - R/W Tick Timer Mode register 

10 1 TTCR R* R/W Tick Timer Count register 
































Table 4-2. List of All Special-Purpose Registers 


SPRs with R* for user mode access are readable in user mode if SR[SUMRA] is 
set. 


The MACLO and MACHI registers are synchronized, such that any ongoing MAC 
operation finishes before they are read or written. 


4.4 General-Purpose Registers (GPRs) 


The thirty-two general-purpose registers are labeled RO-R31 and are 32 bits wide 
in 32-bit implementations and 64 bits wide in 64-bit implementations. They hold scalar 
integer data, floating-point data, vectors or memory pointers. Table 4-3 contains a list of 
general-purpose registers. The GPRs may be accessed as both source and destination 
registers by ORBIS, ORVDX and ORFPX instructions. 


See chapter Application Binary Interface on page 332 for information on floating- 
point data types. See also Register Usage on page 335, where r9 is defined as the Link 
Register. 























Register r31 r30 
Register 129 128 127 126 125 124 
Register 123 122 r21 120 r19 r18 
Register r17 r16 r15 114 13 r12 
Register rit r10 r9 LR r8 r7 r6 
Register r5 r4 r3 r2 ri r0 



































Table 4-3. General-Purpose Registers 


RO should always hold a zero value. It is the responsibility of software to initialize 
it. (This differs from architecture version 0 which commented on implementation and that 
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it should never be used as a destination register — this is no longer specified.) Functions 
of other registers are explained in chapter Application Binary Interface on page 332. 


An implementation may have several sets of GPRs and use them as shadow 
registers, switching between them whenever a new exception occurs. The current set is 
identified by the SR[CID] value. 

An implementation is not required to initialize GPRs to zero during the reset 
procedure. The reset exception handler is responsible for initializing GPRs to zero if that 
is necessary. 


4.5 Support for Custom Number of GPRs 


Programs may be compiled with less than thirty-two registers. Unused registers are 
disabled (set as fixed registers) when compiling code. Such code is also executable on 
normal implementations with thirty-two registers but not vice versa. This feature is quite 
useful since users are expected to move from less powerful OpenRISC implementations 
with less than thirty-two registers to more powerful thirty-two register OpenRISC 
implementations. 


If configuration registers are implemented, CPUCFGR[CGF] indicates whether 
implementation has complete thirty-two general-purpose registers or less than thirty-two 
registers. OR1200 has been implemented with 16 or 32 registers. 


4.6 Supervision Register (SR) 


The Supervison register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode only. 


The SR value defines the state of the processor. 









































Bit 31-28 27-17 16 
Identifier CID Reserved SUMRA 
Reset 0 0 0 
RIW R/W Read Only R/W 
Bit 15 14 13 12 11 10 9 8 
Identifier FO EPH DSX OVE OV CY F CE 
Reset 1 0 0 0 0 0 0 0 
RIW R R/W R/W R/W R/W R/W R/W R/W 
Bit 7 6 5 4 3 2 1 0 
Identifier LEE IME DME ICE DCE IEE TEE SM 
Reset 0 0 0 0 0 0 0 1 
RIW R/W R/W R/W R/W R/W R/W R/W R/W 
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SM Supervisor Mode 
O Processor is in User Mode 
1 Processor is in Supervisor Mode 
TEE Tick Timer Exception Enabled 
O Tick Timer Exceptions are not recognized 
1 Tick Timer Exceptions are recognized 





IEE Interrupt Exception Enabled 
O Interrupts are not recognized 
1 Interrupts are recognized 

DCE Data Cache Enable 
0 Data Cache is not enabled 
1 Data Cache is enabled 





ICE Instruction Cache Enable 
O Instruction Cache is not enabled 
1 Instruction Cache is enabled 
DME Data MMU Enable 
0 Data MMU is not enabled 
1 Data MMU is enabled 
IME Instruction MMU Enable 
0 Instruction MMU is not enabled 
1 Instruction MMU is enabled 
LEE Little Endian Enable 
0 Little Endian (LSB) byte ordering is not enabled 
1 Little Endian (LSB) byte ordering is enabled 
CE CID Enable 
0 CID disabled and shadow registers disabled 
1 CID automatic increment and shadow registers enabled 
F Flag 
0 Conditional branch flag was cleared by sfXX instructions 
1 Conditional branch flag was set by sfXX instructions 
CY Carry flag 
O No carry out produced by last arithmetic operation 
1 Carry out was produced by last arithmetic operation 
OV Overflow flag 
O No overflow occured during last arithmetic operation 
1 Overflow occured during last arithmetic operation 








OVE Overflow flag Exception 
0 Overflow flag does not cause an exception 
1 Overflow flag causes range exception 

DSX Delay Slot Exception 
0 EPCR points to instruction not in the delay slot 

1 EPCR points to instruction in delay slot 
EPH Exception Prefix High 

0 Exceptions vectors are located in memory area starting at OxO 
1 Exception vectors are located in memory area starting at OxFOO00000 
FO Fixed One 
This bit is always set 
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SUMRA SPRs User Mode Read Access 
0 All SPRs are inaccessible in user mode 
1 Certain SPRs can be read in user mode 





CID Context ID (Fast Context Switching (Optional), page 257) 
0-15 Current Processor Context 





Table 4-4. SR Field Descriptions 


4.7 Exception Program Counter Registers 
(EPCRO - EPCR15) 


The Exception Program Counter registers are special-purpose supervisor-level 
registers accessible with the |.mtspr/l.mfspr instructions in supervisor mode. Read access 
in user mode is possible if it is enabled in PCMRx[SUMRA|]. They are 32-bit wide 
registers in 32-bit implementations and can be wider than 32 bits in 64-bit 
implementations. 


After an exception, the EPCR is set to the program counter address (PC) of the 
instruction that was interrupted by the exception. If only one EPCR is present in the 
implementation (Fast Context Switching (Optional) disabled), it must be saved by the 
exception handler routine before exception recognition is re-enabled in the SR. 

















Bit 31-0 
Identifier EPC 
Reset 0 
R/IW R/W 
EPC Exception Program Counter Address 








Table 4-5. EPCR Field Descriptions 


4.8 Exception Effective Address Registers 
(EEARO-EEAR15) 


The Exception Effective Address registers are special-purpose supervisor-level 
registers accessible with the |.mtspr/l.mfspr instructions in supervisor mode. Read access 
in user mode is possible if it is enabled in SRISUMRA]. The EEARs are 32-bit wide 
registers in 32-bit implementations and can be wider than 32 bits in 64-bit 
implementations. 


After an exception, the EEAR is set to the effective address (EA) generated by the 
faulting instruction. If only one EEAR is present in the implementation, it must be saved 
by the exception handler routine before exception recognition is re-enabled in the SR. 
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Bit 31-0 
Identifier EEA 
Reset 0 
R/IW R/W 
EEA Exception Effective Address 





Table 4-6. EEAR Field Descriptions 


4.9 Exception Supervision Registers 
(ESRO-ESR15) 


The Exception Supervision registers are special-purpose supervisor-level registers 
accessible with |.mtspr/l.mfspr instructions in supervisor mode. They are 32 bits wide 
registers in 32-bit implementations and can be wider than 32 bits in 64-bit 
implementations. 


After an exception, the Supervision register (SR) is copied into the ESR. If only one 


ESR is present in the implementation, it must be saved by the exception handler routine 
before exception recognition is re-enabled in the SR. 


























Bit 31-0 
Identifier ESR 
Reset 0 
R/IW R/W 
ESR Exception SR 








Table 4-7. ESR Field Descriptions 


4.10Core Identification Registers (COREID and 
NUMCORES) 


The Core Identification registers are special-purpose registers used in multicore 
platform configurations. They are 32 bit wide registers in 32-bit implementations and can 
be wider than 32 bits in 64-bit implementations. 


The first core is indexed with 0. 


4.11Next and Previous Program Counter 
(NPC and PPC) 


The Program Counter registers represent the address just executed and the address 
instruction just to be executed. 


These and the GPR registers mapped into SPR space should only be used for 
debugging purposes by an external debugger. Applications should use the |.jal instruction 
to obtain the current program counter and arithmethic instructions to obtain GPR register 
values. 
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4.12Floating Point Control Status Register 
(FPCSR) 


Floating point control status register is a 32-bit special-purpose register accessible 
with the |.mtspr/l.mfspr instructions in supervisor mode and as read-only register in user 
mode if enabled in SR[SUMRA]. 


The FPCSR value controls floating point rounding modes, optional generation of 
floating point exception and provides floating point status flags. Status flags are updated 
after every floating point instruction is completed and can serve to determine what caused 
the floating point exception. 


If floating point exception is enabled then FPCSR status flags have to be cleared in 
floating point exception handler. Status flags are cleared by writing 0 to all status bits. 




















Bit 31-12 11 10 9 8 
Identifier Reserved DZF INF IVF IXF 
Reset 0 0 0 0 0 
RIW Read Only R/W R/W R/W R/W 
Bit 7 6 5 4 3 2-1 0 
Identifier ZF QNF SNF UNF OVF RM FPEE 
Reset 0 0 0 0 0 0 0 
RIW R/W R/W R/W R/W R/W R/W R/W 
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RM 


Rounding Mode 
O Round to nearest 
1 Round to zero 
2 Round to infinity+ 
3 Round to infinity- 





OVF 


OVerflow Flag 
O No overflow 
1 Result overflowed 





UNF 


SNF 


UNderflow Flag 
O No underflow 
1 Result underflowed 
SNAN Flag 
O Result not SNAN 
1 Result SNAN 





QNF 


QNAN Flag 
O Result not ONAN 
1 Result QNAN 





ZF 


IXF 


Zero Flag 
O Result not zero 
1 Result zero 
IneXact Flag 
O Result precise 
1 Result inexact 





IVF 


InValid Flag 
O Result valid 
1 Result invalid 





INF 


DZF 








INfinity Flag 
0 Result finite 
1 Result infinite 
Divide by Zero Flag 
O Proper divide 
1 Divide by zero 
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5 Instruction Set 


This chapter describes the OpenRISC 1000 instruction set. 


5.1 Features 


The OpenRISC 1000 instruction set includes the following principal features: 
v¥ Simple and uniform-length instruction formats featuring five Instruction Subsets 


¥ OpenRISC Basic Instruction Set (ORBIS32/64) with 32-bit wide instructions aligned 
on 32-bit boundaries in memory and operating on 32-bit and 64-bit data 


¥ OpenRISC Vector/DSP eXtension (ORVDX64) with 32-bit wide instructions aligned 
on 32-bit boundaries in memory and operating on 8-, 16-, 32- and 64-bit data 


¥ OpenRISC Floating-Point eXtension (ORFPX32/64) with 32-bit wide instructions 
aligned on 32-bit boundaries in memory and operating on 32-bit and 64-bit data 


v Reserved opcodes for custom instructions 


Note: Instructions are divided into instruction classes. Only the basic classes are 
required to be implemented in an OpenRISC 1000 implementation. 


ORVDX64 
ORBIS32 
nstruction Set 
ORBIS64 ORFPX64 


Figure 5-1. Instruction Set 






5.2 Overview 
OpenRISC 1000 instructions belong to one of the following instruction subsets: 

¥ ORBIS32: 

> 32-bit integer instructions 
Basic DSP instructions 
32-bit load and store instructions 
Program flow instructions 
Special instructions 


VV V WV 
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¥ ORBIS64: 
> 64-bit integer instructions 
> 64-bit load and store instructions 
¥ ORFPX32: 
> Single-precision floating-point instructions 
¥ ORFPX64: 
> Double-precision floating-point instructions 
> 64-bit load and store instructions 
¥ ORVDX64: 
> Vector instructions 
DSP instructions 


Vv 


Instructions in each subset are also split into two instruction classes according to 
implementation importance: 


v Class I 
Y Class II 





| Class | Description 
| | | Instructions in class | must always be implemented. 








H Instructions from class II are optional and an implementation may choose to 
use some or all instructions from this class based on requirements of the 
target application. 








Table 5-1. OpenRISC 1000 Instruction Classes 
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l.add Add l.add 
31 |i] l/l }] 265 [hii |} 2nj20 hihi |) 165 ff. Ji. |) || ato 9 g7 |i. ||. |] 443 If. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x0 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 


5.3 ORBIS32/64 


Format: 
l.add rD,rA,rB 
Description: 


The contents of general-purpose register rA are added to the contents of general-purpose 
register rB to form the result. The result is placed into general-purpose register rD. 


The instruction will set the carry flag on unsigned overflow, and the overflow flag on 
signed overflow. 


32-bit Implementation: 

rD[31:0] < rA[31:0] + rB[31:0] 

SR[CY ] - carry (unsigned overflow) 
SR[OV] - Signed overflow 

64-bit Implementation: 

rD[63:0] < rA[63:0] + rB[63:0] 

SR[CY ] - carry (unsigned overflow) 
SR[OV] <- Signed overflow 
Exceptions: 


Range Exception onoverflow if SR[OVE] and AECR[OVADDE] are set. 
Range Exception oncarryif SR[OVE] and AECR[CYADDE] are set. 


Instruction Class 
ORBIS32 I 
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l.addc Add and Carry l.addc 
31 if |). fe. |) 2625 Ji ih |) 2120 |) |b JL || aes |i]. |. |) 120 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x1 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 
Format: 


l.addc rD,rA,rB 

Description: 

The contents of general-purpose register rA are added to the contents of general-purpose 
register rB and carry SR[CY] to form the result. The result is placed into general-purpose 


register rD. 


The instruction will set the carry flag on unsigned overflow, and the overflow flag on 
signed overflow. 


32-bit Implementation: 

rD[31:0] < rA[31:0] + rB[31:0] + SR[CY] 
SR[CY] - carry (unsigned overflow) 
SR[OV] - Signed overflow 

64-bit Implementation: 

rD[63:0] < rA[63:0] + rB[63:0] + SR[CY] 
SR[CY ] - carry (unsigned overflow) 
SR[OV] - overflow 

Exceptions: 


Range Exception onoverflow if SR[OVE] and AECR[OVADDE] are set. 
Range Exception oncarryif SR[OVE] and AECR[CYADDE] are set. 


Instruction Class 
ORBIS32 I 
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l.addi Add Immediate l.addi 
31 Me | fe Hilse 26 25 |). |e 2120 Ji). |i. 1615 J. |e fe He Pe Te Te Te He Te Te FP Te Te 0 
opcode 0x27 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1l.addi rD,rA,I 

Description: 

The immediate value is sign-extended and added to the contents of general-purpose 
register rA to form the result. The result is placed into general-purpose register rD. 

The instruction will set the carry flag on unsigned overflow, and the overflow flag on 
signed overflow. 

32-bit Implementation: 

rD[31:0] < rA[31:0] + exts(Immediate) 

SR[CY] - carry (unsigned overflow) 

SR[OV] - Signed overflow 

64-bit Implementation: 

rD[63:0] < rA[63:0] + exts(Immediate) 

SR[CY ] - carry (unsigned overflow) 

SR[OV] - signed overflow 

Exceptions: 


Range Exception on overflow if SR[OVE] and AECR[OVADDE] are set. 
Range Exception oncarryif SR[OVE] and AECR[CYADDE] are set. 


Instruction Class 
ORBIS32 I 
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laddic Add Immediate and Carry l.addic 
31 «lee [ose 2625 eile [fe 2120 » {le [fe W615 ff. fl. ye ye Te Te Tye Te He Te Te Te He 0) 
opcode 0x28 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


l.addic rD,rA,I 

Description: 

The immediate value is sign-extended and added to the contents of general-purpose 
register rA and carry SR[CY] to form the result. The result is placed into general-purpose 
register rD. 


The instruction will set the carry flag on unsigned overflow, and the overflow flag on 
signed overflow. 


32-bit Implementation: 

rD[31:0] < rA[31:0] + exts(Immediate) + SR[CY] 
SR[CY ] - carry (unsigned overflow) 

SR[OV] - Signed overflow 

64-bit Implementation: 

rD[63:0] < rA[63:0] + exts(Immediate) + SR[CY] 
SR[CY] - carry (unsigned overflow) 

SR[OV ] - Signed overflow 

Exceptions: 


Range Exception onoverflow if SR[OVE] and AECR[OVADDE] are set. 
Range Exception oncarryif SR[OVE] and AECR[CYADDE] are set. 


Instruction Class 
ORBIS32 I 
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land And land 
31 if |). |e. |) 2625 Ji h h |) 2120 |) |h JL || aes |i]. |. |) 120 9 87 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x3 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits A bits A bits 
Format: 


l.and rD,rA,rB 

Description: 

The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical AND operation. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] AND rB[31:0] 

64-bit Implementation: 

rD[63:0] < rA[63:0] AND rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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].andi And with Immediate Half Word ].andi 
31 Me | fe Ml 26 25 |). |e 2120 |). fl. 1615 J. |e fe He Te Te Te Te He Te Te FP Te Te 0) 
opcode 0x29 D A K 
6 bits 5 bits 5 bits 16 bits 
Format: 


l.andi rD,rA,K 

Description: 

The immediate value is zero-extended and combined with the contents of general- 
purpose register rA in a bit-wise logical AND operation. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] AND extz(Immediate) 

64-bit Implementation: 

rD[63:0] < rA[63:0] AND extz(Immediate) 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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l.bf Branch if Flag L.bf 
310. JJ. Te TP TP ease Te Pe Te TPR Te TPR Te Te Te FR Te TE TP TE TPT TE TP Te Te Te Te TTPO 
opcode 0x4 N 
6 bits 26 bits 
Format: 
1. bf N 
Description: 


The immediate value is shifted left two bits, sign-extended to program counter width, and 
then added to the address of the branch instruction. The result is the effective address of 
the branch. If the flag is set, the program branches to EA. If CPUCFGR[ND] is not set, 
the branch occurs with a delay of one instruction. 

32-bit Implementation: 


EA - exts(Immediate << 2) + BranchInsnAddr 
PC ~ EA if SR[F] set 


64-bit Implementation: 


EA - exts(Immediate << 2) + BranchInsnAddr 
PC ~ EA if SR[F] set 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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l.bnf Branch if No Flag l.bnf 
310. JJ. Ie Pe TP ease Te Pe Te TPR Te TPR Te TER Te FE TP TE TPR TE TTP TE TP Te Te TT TET 0 
opcode 0x3 N 
6 bits 26 bits 
Format: 
l.bnf N 
Description: 


The immediate value is shifted left two bits, sign-extended to program counter width, and 
then added to the address of the branch instruction. The result is the effective address of 
the branch. If the flag is cleared, the program branches to EA. If CPUCFGR[ND] is not 
set, the branch occurs with a delay of one instruction. 

32-bit Implementation: 


EA - exts(Immediate << 2) + BranchInsnAddr 
PC - EA if SR[F] cleared 


64-bit Implementation: 


EA + exts(Immediate << 2) + BranchInsnAddr 
PC ~- EA if SR[F] cleared 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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l.cmov Conditional Move l.cmov 
31 |. fei. {fe {} 2625 |h\k\} |] 21/20 |) 4h] ||) 265 ||) || azo 9 giv ||. ||. |) 4 |}. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode Oxe 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits A bits A bits 
Format: 


1.cmov rD,rA,rB 
Description: 


If SR[F] is set, general-purpose register rA is placed in general-purpose register rD. If 
SRIF] is cleared, general-purpose register rB is placed in general-purpose register rD. 


32-bit Implementation: 

rD[31:0] < SR[F] ? rA[31:0] : rB[31:0] 
64-bit Implementation: 

rD[63:0] < SR[F] ? rA[63:0] : rB[63:0] 
Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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L.csync Context Synchronization l.csyne 
opcode 0x23000000 
32 bits 
Format: 
1.csync 
Description: 


Execution of context synchronization instruction results in completion of all operations 
inside the processor and a flush of the instruction pipelines. When all operations are 
complete, the RISC core resumes with an empty instruction pipeline and fresh context in 
all units (MMU for example). 

32-bit Implementation: 

context-synchronization 

64-bit Implementation: 

context-synchronization 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Reserved for ORBIS32/64 Custom 


l.cust1 : l.cust1 
Instructions 
31 J tf. ff. fe 26250 ff. Le He Te Te Te TER Te Te Ye Ye Te FR TR TT TR TTR TER TL LE 
opcode Oxic reserved 
6 bits 26 bits 
Format: 
1l.custi 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 


www.opencores.org 1.2-1 45 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Reserved for ORBIS32/64 Custom 


l.cust2 : l.cust2 
Instructions 
31 go [len exe Me 2625 ff. HP. He JL. ee Te Te Te TR EL TER TR TTT TT TR TR TTI TER TTP 
opcode Ox1d reserved 
6 bits 26 bits 
Format: 
1.cust2 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 


www.opencores.org 1.2-1 46 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Reserved for ORBIS32/64 Custom 


l.cust3 ‘ l.cust3 
Instructions 
31 wtf. ff. fe 2625 ff. Le He Te Te Te TER Te Te Ye eT Te TR TR TT Tee TT TE TR TL LE 
opcode Ox1e reserved 
6 bits 26 bits 
Format: 
1.cust3 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 
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Reserved for ORBIS32/64 Custom 


l.cust4 : l.cust4 
Instructions 
31 je Tre}. Te YP 26S. TP Te Te Te TTP Te TE TR TR Te TR TT TT, TRH TR TP TTR 
opcode 0x1f reserved 
6 bits 26 bits 
Format: 
1l.cust4 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 
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Reserved for ORBIS32/64 Custom 


l.cust5 ‘ l.cust5 
Instructions 
31 edt. de de 2625 mam be 2120 eff. Te 1615 . {te |i. 1110 Prat 54 effete 0 
opcode 0x3c D A B L K 
6 bits 5 bits 5 bits 5 bits 6 bits 5 bits 
Format: 


1.cust5 rD,rA,rB,L,K 

Description: 

This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 
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Reserved for ORBIS32/64 Custom 


l.cust6 ‘ l.cust6 
Instructions 
31 go [len Mex! fie 2625 ff. HP. Te IL. ee Te Te Te TR EL TER TR TP, TT TT TR Te TT TR TTP 
opcode 0x3d reserved 
6 bits 26 bits 
Format: 
1.cust6 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 
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Reserved for ORBIS32/64 Custom 


l.cust7 : l.cust7 
Instructions 
31 J ff. ff. He 2625 ff. Le He Te Te TL TER Te Te Ye Ye Te Te FR TR TT TR TT TR TR TEE HE 
opcode 0x3e reserved 
6 bits 26 bits 
Format: 
1l.cust7 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 
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Reserved for ORBIS32/64 Custom 


l.cust8 ‘ l.cust8 
Instructions 
31 JTree Te] 26S. TT Te Pe TTR Te TE TTR Te TTR TTT FLT TR TP TRE 0 
opcode 0x3f reserved 
6 bits 26 bits 
Format: 
1.cust8 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but rather by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORBIS32 II 
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l.div Divide Signed l.div 
31 if |) fe. |) 2625 Ji ih |) 2120 |) |e /L || aes |i]. |. |) 120 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x3 reserved opcode 0x9 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 

Format: 


1.div rD,rA,rB 

Description: 

The content of general-purpose register rA are divided by the content of general-purpose 
register rB, and the result is placed into general-purpose register rD. Both operands are 


treated as signed integers. 


On divide-by zero, rD will be undefined, and the overflow flag will be set. Note that prior 
revisions of the manual (pre-2011) stored the divide by zero flag in SR[CY]. 


32-bit Implementation: 


rD[31:0] < rA[31:0] / rB[31:0] 
SR[OV] - rB[31:0] == 


64-bit Implementation: 


rD[63:0] < rA[63:0] / rB[63:0] 
SR[OV] - rB[63:0] == 


Exceptions: 


Range Exception when divisor is zero if SR[OVE] and AECR[DBZE] are set. 


Instruction Class 
ORBIS32 II 
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l.divu Divide Unsigned l.divu 
31 i. |e Jie |e {| 26125 |h\k{} |] 21/20 |) 4h] ||) 165 /./). |) || azo 9 aly ||. ||.) 48 |}. |. 0 
opcode 0x38 D A B reserved opcode 0x3 reserved opcode Oxa 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 
Format: 


1.divu rD,rA,rB 
Description: 
The content of general-purpose register rA are divided by the content of general-purpose 


register rB, and the result is placed into general-purpose register rD. Both operands are 
treated as unsigned integers. 


On divide-by zero, rD will be undefined, and the overflow flag will be set. 
32-bit Implementation: 


rD[31:0] < rA[31:0] / rB[31:0] 
SR[CY ] - rB[31:0] == 


64-bit Implementation: 


rD[63:0] < rA[63:0] / rB[63:0] 
SR[CY] - rB[63:0] == 


Exceptions: 


Range Exception when divisor is zero if SR[OVE] and AECR[DBZE] are set. 


Instruction Class 
ORBIS32 II 
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l.extbs Extend Byte with Sign l.extbs 
31 |j.|). |). |. || 2625 |].|f |).|) 21/20 |/.)).]).)} 165 | hI. |, ||) 109 Palle 65 43 ale 0 
opcode 0x38 D A reserved opcode 0x1 reserved opcode Oxc 
6 bits 5 bits 5 bits 6 bits 4 bits 2 bits 4 bits 
Format: 


l.extbs rD,rA 

Description: 

Bit 7 of general-purpose register rA is placed in high-order bits of general-purpose 
register rD. The low-order eight bits of general-purpose register rA are copied into the 
low-order eight bits of general-purpose register rD. 


32-bit Implementation: 


rD[31:8] < rA[7] 
rD[7:0] < rA[7:0] 


64-bit Implementation: 


rD[63:8] < rA[7] 
rD[7:0] < rA[7:0] 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lextbz Extend Byte with Zero L.extbz 
31 ff. |i. |]. J] |} 2625 fei fe || 2120 Jf. |) |. |} 65 |i fei t/|| 1019 Palle 65 43 . |i. 0 
opcode 0x38 D A reserved opcode 0x3 reserved opcode Oxc 
6 bits 5 bits 5 bits 6 bits 4 bits 2 bits 4 bits 

Format: 


1l.extbz rD,rA 

Description: 

Zero is placed in high-order bits of general-purpose register rD. The low-order eight bits 
of general-purpose register rA are copied into the low-order eight bits of general-purpose 
register rD. 


32-bit Implementation: 


rD[31:8] < 0 
rD[7:0] < rA[7:0] 


64-bit Implementation: 


rD[63:8] < 0 
rD[7:0] < rA[7:0] 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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l.exths Extend Half Word with Sign l.exths 
31 oj. |). |). ||. || 2625 ||.) |). |) 21/20 |/.)).]).)} 165 | hI. |, ||) 109 elie 65 43 ale 0 
opcode 0x38 D A reserved opcode 0x0 reserved opcode Oxc 
6 bits 5 bits 5 bits 6 bits 4 bits 2 bits 4 bits 
Format: 


l.exths rD,rA 

Description: 

Bit 15 of general-purpose register rA is placed in high-order bits of general-purpose 
register rD. The low-order 16 bits of general-purpose register rA are copied into the low- 
order 16 bits of general-purpose register rD. 


32-bit Implementation: 


rD[31:16] < rA[15] 
rD[15:0] < rA[15:0] 


64-bit Implementation: 


rD[63:16] < rA[15] 
rD[15:0] < rA[15:0] 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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L.exthz Extend Half Word with Zero L.exthz 
31 ff. |i. |]. J] |} 2625 fei fe || 2120 Jf. |) i. |} 1605 |e Heh TP || 1019 Palle 65 43 . |i. 0 
opcode 0x38 D A reserved opcode 0x2 reserved opcode Oxc 
6 bits 5 bits 5 bits 6 bits A bits 2 bits A bits 

Format: 


1l.exthz rD,rA 

Description: 

Zero is placed in high-order bits of general-purpose register rD. The low-order 16 bits of 
general-purpose register rA are copied into the low-order 16 bits of general-purpose 
register rD. 


32-bit Implementation: 


rD[31:16] < 0 
rD[15:0] < rA[15:0] 


64-bit Implementation: 


rD[63:16] < 0 
rD[15:0] < rA[15:0] 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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lL.extws Extend Word with Sign l.extws 
31 ff. J). |). ff |f 2625 |e] |}. || 2120 |i |) i) aes |). efi ||. |} 10l9 . ||. 65 43 Pall be 0 
opcode 0x38 D A reserved opcode 0x0 reserved opcode Oxd 
6 bits 5 bits 5 bits 6 bits 4 bits 2 bits 4 bits 
Format: 


l.extws rD,rA 

Description: 

Bit 31 of general-purpose register rA is placed in high-order bits of general-purpose 
register rD. The low-order 32 bits of general-purpose register rA are copied from low- 
order 32 bits of general-purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] 

64-bit Implementation: 


rD[63:32] < rA[31] 
rD[31:0] < rA[31:0] 


Exceptions: 


None 


Instruction Class 
ORBIS64 II 
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Lextwz Extend Word with Zero l.extwz 
31 ff. J). |). ff |f 2625 |e] |}. ]| 2120 |i |) i |) aes |). Jeti ||.) 10/9 . ||. 65 43 Pall fe 0 
opcode 0x38 D A reserved opcode 0x1 reserved opcode Oxd 
6 bits 5 bits 5 bits 6 bits A bits 2 bits A bits 

Format: 


1l.extwz rD,rA 

Description: 

Zero is placed in high-order bits of general-purpose register rD. The low-order 32 bits of 
general-purpose register rA are copied into the low-order 32 bits of general-purpose 
register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] 

64-bit Implementation: 


rD[63:32] < 0 
rD[31:0] < rA[31:0] 


Exceptions: 


None 


Instruction Class 
ORBIS64 II 
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L.ff1 Find First 1 1.ff1 
31 if). ef. ]) 2625 |i Jil || 2120 |i }h {i |} 16445 |) |). |). |) 1110 9 Bly ||. |). || 418 |}. |]. 0 
opcode 0x38 D A reserved reserved opcode 0x0 reserved opcode Oxf 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 

Format: 
1.ff1 rD,rA 
Description: 


Position of the lowest order '1' bit is written into general-purpose register rD. Checking 
for bit '1' starts with bit 0 (LSB), and counting is incremented for every zero bit. If first '1' 
bit is discovered in LSB, one is written into rD, if first '1' bit is discovered in MSB, 32 
(64) is written into rD. If there is no '1' bit, zero is written in rD. 

32-bit Implementation: 

rD[31:0] « rA[O] 2? 1: rA[1] ? 2... rA[31] ? 32 : 0 

64-bit Implementation: 

rD[63:0] « rA[O] 2? 1: rA[1] ? 2... rA[63] ? 64: 0 
Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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1.fl1 Find Last 1 1.fl1 
31 if |). |. Jf. |) 2625 |i Ji Ji] 2120 |i ]}h {|} 1645 |) |). |). |) 110 9 aly ||. |]. | 418 |}. |]. 0 
opcode 0x38 D A reserved reserved opcode 0x1 reserved opcode Oxf 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 
Format: 
1.f11 rD,rA 
Description: 


Position of the highest order '1' bit is written into general-purpose register rD. Checking 
for bit '1' starts with bit 31/63 (MSB), and counting is decremented for every zero bit 
until the last ‘1’ bit is found nearing the LSB. If highest order '1' bit is discovered in 
MSB, 32 (64) is written into rD, if highest order '1' bit is discovered in LSB, one is 
written into rD. If there is no '1' bit, zero is written in rD. 

32-bit Implementation: 

rD[31:0] « rA[31] ? 32 : rA[30] ? 31... rA[O] 2? 1: 0 

64-bit Implementation: 

rD[63:0] « rA[63] ? 64 : rA[62] ? 63... rA[O] 2? 1: 0 
Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lj Jump Lj 
310. JJ. Ie Pe TP ease Te Pe Te TP. Te TPR Te He Te FR TP TE TP TE TPT TP TP Te Te TE Te TTP 0 
opcode 0x0 N 
6 bits 26 bits 
Format: 
1.jN 
Description: 


The immediate value is shifted left two bits, sign-extended to program counter width, and 
then added to the address of the jump instruction. The result is the effective address of the 
jump. The program unconditionally jumps to EA. If CPUCFGR[ND] is not set, the jump 
occurs with a delay of one instruction. 


Note that l.sys should not be placed in the delay slot after a jump. 
32-bit Implementation: 
PC - exts(Immediate << 2) + JumpInsnAddr 
64-bit Implementation: 
PC «+ exts(Immediate << 2) + JumpInsnAddr 
Exceptions: 
TLB miss 


Page fault 
Bus error 


Instruction Class 
ORBIS32 I 
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Ljal Jump and Link L.jal 
310. JJ. Ie TP TP eas, Te Pe Te TPR Te TPR Te He Te FE Te TE TPR TE TTR TE TP Te Te TE Te TET 0 
opcode 0x1 N 
6 bits 26 bits 
Format: 
l.jal N 
Description: 


The immediate value is shifted left two bits, sign-extended to program counter width, and 
then added to the address of the jump instruction. The result is the effective address of the 
jump. The program unconditionally jumps to EA. If CPUCFGR[ND] is not set, the jump 
occurs with a delay of one instruction. The address of the instruction after the delay slot is 
placed in the link register r9 (see Register Usage on page 335). 


The value of the link register, if read as an operand in the delay slot will be the 
new value, not the old value. If the link register is written in the delay slot, the value 
written will replace the value stored by the 1. jal instruction. 


Note that l.sys should not be placed in the delay slot after a jump. 
32-bit Implementation: 


PC < exts(Immediate << 2) + JumpInsnAddr 
LR « CPUCFGR[ND] ? JumpInsnAddr + 4 : DelayInsnAddr + 4 


64-bit Implementation: 


PC - exts(Immediate << 2) + JumpInsnAddr 
LR « CPUCFGR[ND] ? JumpInsnAddr + 4 : DelayInsnAddr + 4 


Exceptions: 
TLB miss 


Page fault 
Bus error 


Instruction Class 
ORBIS32 I 
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L.jalr Jump and Link Register Ljalr 
31 Me | fl 2625 |i. fe df. J]. [fe Fle He 1615 Alte Me 1110 Jf. fe Tee Je Tf FP He 0 
opcode 0x12 reserved B reserved 
6 bits 10 bits 5 bits 11 bits 
Format: 
1l.jalr rB 
Description: 


The contents of general-purpose register rB is the effective address of the jump. The 
program unconditionally jumps to EA. If CPUCFGRI[ND] is not set, the jump occurs 
with a delay of one instruction. The address of the instruction after the delay slot is placed 
in the link register. 

It is not allowed to specify link register r9 (see Register Usage on page 335) as rB. 
This is because an exception in the delay slot (including external interrupts) may cause 
Ljalr to be reexecuted. 

The value of the link register, if read as an operand in the delay slot will be the 
new value, not the old value. If the link register is written in the delay slot, the value 
written will replace the value stored by the 1. jalr instruction. 


Note that l.sys should not be placed in the delay slot after a jump. 
32-bit Implementation: 


PC + rB 
LR « CPUCFGR[ND] ? JumpInsnAddr + 4 : DelayInsnAddr + 4 


64-bit Implementation: 


PC «- rB 
LR « CPUCFGR[ND] ? JumpInsnAddr + 4 : DelayInsnAddr + 4 


Exceptions: 
Alignment 
TLB miss 


Page fault 
Bus error 


Instruction Class 
ORBIS32 I 
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Ljr 


31 Wise |e NS 26 25 


opcode 0x11 
6 bits 


Format: 
l.jr rB 


Description: 


OpenRISC 1000 Architecture Manual 


Jump Register 


1615 


reserved 


10 bits 


5 bits 


1110 


September 7, 2017 


reserved 
11 bits 


The contents of general-purpose register rB is the effective address of the jump. The 
program unconditionally jumps to EA. If CPUCFGRI[ND] is not set, the jump occurs 
with a delay of one instruction. 


Note that |.sys should not be placed in the delay slot after a jump. 


32-bit Implementation: 


PC - rB 


64-bit Implementation: 


PC ~- rB 

Exceptions: 
Alignment 
TLB miss 


Page fault 
Bus error 
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L.Ibs Load Byte and Extend with Sign ].Ibs 
31 else | oe ii 2625 elie |e 2120 ~ {te |e W615 of}. fi. ye ye Te Te Tye Te He Te Te Te Te 0 
opcode 0x24 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1l.lbs rD,I(rA) 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The byte in memory addressed by EA is loaded into 
the low-order eight bits of general-purpose register rD. High-order bits of general- 
purpose register rD are replaced with bit 7 of the loaded value. 

32-bit Implementation: 

EA -— exts(Immediate) + rA[31:0] 

rD[7:0] < (EA)[7:0] 

rD[31:8] < (EA)[7] 

64-bit Implementation: 

EA - exts(Immediate) + rA[63:0] 

rD[7:0] < (EA)[7:0] 

rD[63:8] < (EA)[7] 

Exceptions: 


TLB miss 
Page fault 
Bus error 


Instruction Class 
ORBIS32 I 
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L.ibz Load Byte and Extend with Zero L.Ibz 
31 eo Wl] oe ie 2625 elie |e 2120 eft. |e W615 ff. fi. ye Hye Tye Te Tye Te He Te Te Pe Te 0 
opcode 0x23 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.loz rD,I(rA) 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The byte in memory addressed by EA is loaded into 
the low-order eight bits of general-purpose register rD. High-order bits of general- 
purpose register rD are replaced with zero. 

32-bit Implementation: 

EA -— exts(Immediate) + rA[31:0] 

rD[7:0] < (EA)[7:0] 

rD[31:8] <« 0 

64-bit Implementation: 

EA - exts(Immediate) + rA[63:0] 

rD[7:0] < (EA)[7:0] 

rD[63:8] « 0 

Exceptions: 

TLB miss 


Page fault 
Bus error 


Instruction Class 
ORBIS32 I 
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l.ld 


31 zlffce |] Me 26 25 
opcode 0x20 
6 bits 


Format: 
1.l1ld rD,I(rA) 


Description: 


OpenRISC 1000 Architecture Manual 


Load Double Word 
2120 effete 1615 fi. fj. |i. il. di). 
D A 
5 bits 5 bits 


September 7, 2017 


Lid 


16 bits 


The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The double word in memory addressed by EA is 
loaded into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


EA - exts(Immediate) + rA[63:0] 
rD[63:0] < (EA)[63:0] 


Exceptions: 


TLB miss 
Page fault 
Bus error 
Alignment 
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l.Ihs Load Half Word and Extend with Sign __IL.lhs 


31 Ltt. Tp. Tp. 26 25 Talal e 2120 {fe |. wis ff. Hf eT Te Te Te Ree eT The TP 0 
opcode 0x26 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1l.lhs rD,I(rA) 
Description: 


The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The half word in memory addressed by EA is loaded 
into the low-order 16 bits of general-purpose register rD. High-order bits of general- 
purpose register rD are replaced with bit 15 of the loaded value. 


32-bit Implementation: 

EA - exts(Immediate) + rA[31:0] 
rD[15:0] < (EA)[15:0] 

rD[31:16] <— (EA)[15] 

64-bit Implementation: 

EA - exts(Immediate) + rA[63:0] 
rD[15:0] < (EA)[15:0] 

rD[63:16] <— (EA)[15] 

Exceptions: 


TLB miss 
Page fault 
Bus error 
Alignment 


Instruction Class 
ORBIS32 I 
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L.hz Load Half Word and Extend with Zero L.hz 


31 Me | fe Mie 26 25 |). |e 2120 |). fl. 1615 J. |). fe He Te Te Te Te He Te Te FP Te Te 0 
opcode 0x25 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.1lhz rD,I(rA) 
Description: 


The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The half word in memory addressed by EA is loaded 
into the low-order 16 bits of general-purpose register rD. High-order bits of general- 
purpose register rD are replaced with zero. 


32-bit Implementation: 

EA - exts(Immediate) + rA[31:0] 
rD[15:0] < (EA)[15:0] 

rD[31:16] < 0 

64-bit Implementation: 

EA - exts(Immediate) + rA[63:0] 
rD[15:0] < (EA)[15:0] 

rD[63:16] < 0 

Exceptions: 

TLB miss 

Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS32 I 
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l.lwa Load Single Word Atomic l.lwa 
31 edie fe de 26 25 |}. |e 2120 |e |e 1615 fe |). Ife Te Te Te Te Te He Te The Fe He 0) 
opcode 0x1b D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.lwa rD,I(rA) 
Description: 


The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The single word in memory addressed by EA is 
loaded into the low-order 32 bits of general-purpose register rD. High-order bits of 
general-purpose register rD are replaced with zero. 

An atomic reservation is placed on the address formed from EA. In case an MMU is 
enabled, the physical translation of EA is used. 


32-bit Implementation: 


EA - exts(Immediate) + rA[31:0] 
rD[31:0] — (EA)[31:0] 
atomic_reserve[to_phys(EA) ] «1 


64-bit Implementation: 


EA 

rD[31:0] 

rD[63:32] 
atomic_reserve[to_phys(EA) ] 


exts(Immediate) + rA[63:0] 
(EA) [31:0] 
0 


td et 


1 
Exceptions: 

TLB miss 

Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS32 II 
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l.lws Load Single Word and Extend with Sign _ L.lws 


31 Me | fe Me 26 25 |). |e 2120 Ji). |i. 1615 J. |e fe He Te Pe Ue Te He Te Te FP Te TY 0 
opcode 0x22 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.lws rD,I(rA) 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The single word in memory addressed by EA is 
lloaded into the low-order 32 bits of general-purpose register rD. High-order bits of 
general-purpose register rD are replaced with bit 31 of the loaded value. 


32-bit Implementation: 


EA - exts(Immediate) + rA[31:0] 
rD[31:0] < (EA)[31:0] 


64-bit Implementation: 

EA - exts(Immediate) + rA[63:0] 
rD[31:0] < (EA)[31:0] 

rD[63:32] <— (EA)[31] 

Exceptions: 

TLB miss 

Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS32 I 
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Llwz Load Single Word and Extend with Zero — IL.lwz 


31 Jit. Te Hp. 26 25 Talal e 2120 J {fe |. wis ff. Hf LT Te Te Tee Jee TP The TP 0 
opcode 0x21 D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.lwz rD,I(rA) 
Description: 


The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The single word in memory addressed by EA is 
loaded into the low-order 32 bits of general-purpose register rD. High-order bits of 
general-purpose register rD are replaced with zero. 


32-bit Implementation: 


EA -— exts(Immediate) + rA[31:0] 
rD[31:0] < (EA)[31:0] 


64-bit Implementation: 

EA - exts(Immediate) + rA[63:0] 
rD[31:0] < (EA)[31:0] 

rD[63:32] < 0 

Exceptions: 

TLB miss 

Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS32 I 
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l.mac Multiply and Accumulate Signed ].mac 
31. |/. |/. fe |} 2625 |i. fe def) 2020 fe fe tye] 165 fe tye fe J] aato ff Pe ye fe de 43 (Ils 0 
opcode 0x31 reserved A B reserved opcode 0x1 
6 bits 5 bits 5 bits 5 bits 7 bits 4 bits 
Format: 


1.mac rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the 64 bit result is added to the special-purpose registers MACHI 
and MACLO. All operands are treated as signed integers. 


The instruction will set the overflow flag if signed overflow is detecting during the 
addition stage. 


32-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] + 
rA[31:0] * rB[31:0] 

SR[OV] - Signed overflow during addition stage 

64-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] + 
rA[63:0] * rB[63:0] 

SR[OV] - Signed overflow during addition stage 


Exceptions: 


Range Exception onsigned overflow if SR[OVE] and AECR[OVMACADDE | 
are set. 


Instruction Class 
ORBIS32 II 
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Multiply Immediate and Accumulate 


].maci : ].maci 
Signed 
Blof. ef). Te TP 265. Te Tey} ajo. Tey} gelas fe Pe Pe Re RT. eT. TT TE TR TP TE TE 0 
opcode 0x13 reserved A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.maci rA,I 

Description: 

The immediate value and the contents of general-purpose register rA are multiplied, and 
the 64 bit result is added to the special-purpose registers MACHI and MACLO. All 


operands are treated as signed integers. 


The instruction will set the overflow flag if signed overflow is detecting during the 
addition stage. 


32-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] + 
rA[31:0] * exts(Immediate) 

SR[OV] - Signed overflow during addition stage 

64-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] + 
rA[63:0] * exts(Immediate) 

SR[OV] - Signed overflow during addition stage 

Exceptions: 


Range Exception on signed overflow if SR[OVE] and AECR[OVMACADDE | 
are set. 


Instruction Class 
ORBIS32 II 
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].macrc MAC Read and Clear ].macrc 
31 {fe the |. 26 25 Jf. I. 2120 Jd. wie |. Te fee ee Te Tee Te Le He Te Je TP 0 
opcode 0x6 D reserved opcode 0x10000 
6 bits 5 bits 4 bits 17 bits 
Format: 


1l.macre rD 
Description: 


Once all instructions in MAC pipeline are completed, the contents of MAC is placed into 
general-purpose register rD and MAC accumulator is cleared. 

The MAC pipeline also synchronizes with the instruction pipeline on any access to 
MACLO or MACHI SPRs, so that l.mfspr can be used to read MACHI before executing 
Lmacre. 


32-bit Implementation: 
synchronize-mac 


rD[31:0] 
MACLO[31:0], MACHI[31:0] 


t 


MACLO[31:0] 
0 


t 


64-bit Implementation: 


synchronize -mac 
rD[63:0] 
MACLO[31:0], MACHI[31:0] 


t 


t 


MACHI[31:0]MACLO[31:0] 
Q 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lmacu Multiply and Accumulate Unsigned — |.macu 


31 eile fie fe |] 2625 |}. fj. }]. |] 2u20  f). J). ff aes |). Jf. fe |] atta if. fe |} fe 43 2 NS 0 
opcode 0x31 reserved A B reserved opcode 0x3 
6 bits 5 bits 5 bits 5 bits 7 bits 4 bits 
Format: 


1.macu rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the 64 bit result is added to the special-purpose registers MACHI 
and MACLO. All operands are treated as unsigned integers. 


The instruction will set the overflow flag if unsigned overflow is detecting during the 
addition stage. 


32-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] + 
rA[31:0] * rB[31:0] 

SR[CY] - unsigned overflow during addition stage 

64-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] + 
rA[63:0] * rB[63:0] 

SR[CY] - unsigned overflow during addition stage 


Exceptions: 


Range Exception on unsigned overflow if SR[OVE] and AECR[CYMACADDE ] 
are set. 


Instruction Class 
ORBIS32 II 
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lLmfspr Move From Special-Purpose Register |.mfspr 


31 eM We hs 26 25 |). |e 2120 Jd. fl. 1615 fe |e fe epee Te Te Te Te eT Te 0) 
opcode 0x2d D A K 
6 bits 5 bits 5 bits 16 bits 
Format: 


1l.mfspr rD,rA,K 
Description: 


The contents of the special register, defined by contents of general-purpose rA logically 
ORed with immediate value, are moved into general-purpose register rD. 


32-bit Implementation: 

rD[31:0] < spr(rA OR Immediate) 
64-bit Implementation: 

rD[63:0] < spr(rA OR Immediate) 
Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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L.movhi Move Immediate High l.movhi 
31 Pala ea 2625 se | oe | Is 2120 me | ln 1716 TE Nic hae [ge | fae | ice ise Mle | Paes se | Hi Ue Mie | fn 0 
opcode 0x6 D reserved opcode 0x0 K 
6 bits 5 bits 4 bits 1 bit 16 bits 
Format: 


1.movhi rD,K 
Description: 


The 16-bit immediate value is zero-extended, shifted left by 16 bits, and placed into 
general-purpose register rD. 


32-bit Implementation: 

rD[31:0] < extz(Immediate) << 16 
64-bit Implementation: 

rD[63:0] < extz(Immediate) << 16 
Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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l.msb Multiply and Subtract Signed l.msb 
31. |/. |]. fe |} 2625 |i. fe def) 2020 |p. fe fy} 165 ff tye fe J] aato fe Pe ye fe de 43 (IIs 0 
opcode 0x31 reserved A B reserved opcode 0x2 
6 bits 5 bits 5 bits 5 bits 7 bits 4 bits 
Format: 
1.msb rA,rB 
Description: 


The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the 64 bit result is subtracted from the special-purpose registers 
MACHI and MACLO. Result of the subtraction is placed into MACHI and MACLO 
registers. All operands are treated as signed integers. 


The instruction will set the overflow flag if signed overflow is detecting during the 
subtraction stage. 


32-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] - 
rA[31:0] * rB[31:0] 

SR[OV] - signed overflow during subtraction stage 

64-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] - 
rA[63:0] * rB[63:0] 

SR[OV] - signed overflow during subtraction stage 

Exceptions: 


Range Exception onsigned overflow if SR[OVE] and AECR[OVMACADDE | 
are set. 


Instruction Class 
ORBIS32 II 
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l.msbu Multiply and Subtract Unsigned l.msbu 


31 eVlefie fe f] 2625 |}. fj. }]. |] 220 f). J). ff aeas |). jf. fe |] atta if. fe fe fe 43 2 NS 0 
opcode 0x31 reserved A B reserved opcode 0x4 
6 bits 5 bits 5 bits 5 bits 7 bits 4 bits 
Format: 


1.msbu rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the 64 bit result is subtracted from the special-purpose registers 
MACHI and MACLO. Result of the subtraction is placed into MACHI and MACLO 
registers. All operands are treated as unsigned integers. 


The instruction will set the overflow flag if unsigned overflow is detecting during the 
subtraction stage. 


32-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] - 
rA[31:0] * rB[31:0] 

SR[CY] - unsigned overflow during subtraction stage 

64-bit Implementation: 

MACHI[31:0]MACLO[31:0] — MACHI[31:0]MACLO[31:0] - 
rA[63:0] * rB[63:0] 

SR[CY] - unsigned overflow during subtraction stage 

Exceptions: 


Range Exception onsigned overflow if SR[OVE] and AECR[CYMACADDE | 
are set. 


Instruction Class 
ORBIS32 II 
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l.msync Memory Synchronization l.msync 
opcode 0x22000000 
32 bits 
Format: 
1.msync 
Description: 


Execution of the memory synchronization instruction results in completion of all 
load/store operations before the RISC core continues. 


32-bit Implementation: 
memory-synchronization 
64-bit Implementation: 
memory-synchronization 
Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lmtspr Move To Special-Purpose Register —_|.mtspr 


31 elle | Keeal 3 2625 ||. |i. ||. |) 2120 ||. Ff. |). 1615 ||. ||. ||. wjao |). fee fe fe eye Pe 0 
opcode 0x30 K A B K 
6 bits 5 bits 5 bits 5 bits 11 bits 
Format: 


l.mtspr rA,rB,K 
Description: 


The contents of general-purpose register rB are moved into the special register defined by 
contents of general-purpose register rA logically ORed with the immediate value. 


32-bit Implementation: 

spr(rA OR Immediate) - rB[31:0] 
64-bit Implementation: 

spr(rA OR Immediate) - rB[31:0] 
Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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].mul Multiply Signed L.mul 
31 if |). ef. |) 2625 Ji] hh|) 2120 |) |h JL || aes |i]. |. |) a1t0 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x3 reserved opcode 0x6 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 

Format: 


1.mul rD,rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the result is truncated to destination register width and placed into 
general-purpose register rD. Both operands are treated as signed integers. 

The instruction will set the overflow flag on signed overflow. 


32-bit Implementation: 


rD[31:0] < rA[31:0] * rB[31:0] 
SR[OV] - Signed overflow 


64-bit Implementation: 


rD[63:0] < rA[63:0] * rB[63:0] 
SR[OV] - Signed overflow 


Exceptions: 


Range Exception onsigned overflow if SR[OVE] and AECR[OVMULE] are set. 


Instruction Class 
ORBIS32 II 
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].muld Multiply Signed to Double ].muld 
31 if |) fei. |) 2625 Ji ih|) 2120 |) h JL || aes |i |i}. |) 1120 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 reserved A B reserved opcode 0x3 reserved opcode 0x7 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 

Format: 


1.muld rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the result is stored in the MACHI and MACLO registers. Both 
operands are treated as signed integers. 

The instruction will set the overflow flag on signed overflow. 

32-bit Implementation: 

MACHI[31:0]MACLO[31:0] <— rA[31:0] * rB[31:0] 

64-bit Implementation: 


MACHI[31:0]MACLO[31:0] <— rA[63:0] * rB[63:0] 
SR[OV] - Signed overflow 


Exceptions: 


Range Exception onsigned overflow if SR[OVE] and AECR[OVMULE] are set. 


Instruction Class 
ORBIS32 II 
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].muldu Multiply Unsigned to Double l.muldu 


31 if |). ie. |) 2625 Ji he |) 2120 |) |, J) || aes |i]. |. |) 110 9 a7 ||. ||. || 43 |). |]. 0 
opcode 0x38 reserved A B reserved opcode 0x3 reserved opcode Oxc 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 
Format: 


1.muldu rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the result is stored in the MACHI and MACLO registers. Both 
operands are treated as unsigned integers. 

The instruction will set the overflow flag on unsigned overflow. 

32-bit Implementation: 

MACHI[31:0]MACLO[31:0] <— rA[31:0] * rB[31:0] 

64-bit Implementation: 


MACHI[31:0]MACLO[31:0] <— rA[63:0] * rB[63:0] 
SR[CY] - unsigned overflow 


Exceptions: 


Range Exception on signed overflow if SR[OVE] and AECR[CYMULE] are set. 


Instruction Class 
ORBIS32 II 
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Lmuli Multiply Immediate Signed ].muli 
31 e fle ff. df. 2625 ~ {te df. 2120 ~ {te |. W615 jy. Jf. ye Te J fe Te Te Te Te Pe He Pe Pe TT 0 
opcode 0x2c D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.muli rD,rA,I 

Description: 

The immediate value and the contents of general-purpose register rA are multiplied, and 
the result is truncated to destination register width and placed into general-purpose 
register rD. 

The instruction will set the overflow flag on signed overflow. 


32-bit Implementation: 


rD[31:0] < rA[31:0] * exts(Immediate) 
SR[OV] - signed overflow 


64-bit Implementation: 


rD[63:0] < rA[63:0] * exts(Immediate) 
SR[OV] - signed overflow 


Exceptions: 


Range Exception onsigned overflow if SR[OVE] and AECR[OVMULE] are set. 


Instruction Class 
ORBIS32 II 
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Lmulu Multiply Unsigned ].mulu 
31 if |). fe. |) 2625 Ji i{h|) 2120 |) |b JL || aes |i]. |. |) at0 9 g7 |i. ||. |] 43 |f. |. 0 
opcode 0x38 D A B reserved opcode 0x3 reserved opcode Oxb 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits 4 bits 4 bits 

Format: 


1.mulu rD,rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are multiplied, and the result is truncated to destination register width and placed into 
general-purpose register rD. Both operands are treated as unsigned integers. 

The instruction will set the carry flag on unsigned overflow. 


32-bit Implementation: 


rD[31:0] <— rA[31:0] * rB[31:0] 
SR[CY] - carry (unsigned overflow) 


64-bit Implementation: 


rD[63:0] <— rA[63:0] * rB[63:0] 
SR[CY] - carry (unsigned overflow) 


Exceptions: 


Range Exception onunsigned overflow if SR[OVE] and AECR[CYMULE] are 
set. 


Instruction Class 
ORBIS32 II 
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l.nop No Operation l.nop 
31}. |e Pe He J} Te TP 24l2 J ye Pe Pe TP YP tes He Pe eT Te Te TE TE TR Te TTP. HE TE oO 
opcode 0x15 reserved K 
8 bits 8 bits 16 bits 
Format: 
1.nop K 
Description: 


This instruction does not do anything except that it takes at least one clock cycle to 
complete. It is often used to fill delay slot gaps. Immediate value can be used for 
simulation purposes. 


32-bit Implementation: 


64-bit Implementation: 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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l.or Or lor 
31 if |) fe Jf. |) 2625 Ji h{h|) 2120 |) kJ) || aes |i]. |. |) 1120 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x4 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits A bits A bits 
Format: 


l.or rD,rA,rB 

Description: 

The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical OR operation. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] OR rB[31:0] 

64-bit Implementation: 

rD[63:0] < rA[63:0] OR rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lori Or with Immediate Half Word Lori 
31 ns 2625 ef. fl. 2120 afi. dy. 1615 oT. Ue ype Te TP. TP. Ee Pe FP Te Te Pe TP 0 
opcode 0x2a D A K 
6 bits 5 bits 5 bits 16 bits 
Format: 


l.ori rD,rA,K 

Description: 

The immediate value is zero-extended and combined with the contents of general- 
purpose register rA in a bit-wise logical OR operation. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] OR extz(Immediate) 

64-bit Implementation: 

rD[63:0] < rA[63:0] OR extz(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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L.psync Pipeline Synchronization L.psync 
opcode 0x22800000 
32 bits 
Format: 
1.psync 
Description: 


Execution of pipeline synchronization instruction results in completion of all instructions 
that were fetched before |.psync instruction. Once all instructions are completed, 
instructions fetched after |.psync are flushed from the pipeline and fetched again. 

32-bit Implementation: 

pipeline-synchronization 

64-bit Implementation: 

pipeline-synchronization 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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L.rfe Return From Exception L.rfe 
3100. JJ. Ie TP TP ease Te Pe Te TP. Te TPR Te Te Te Ee TP TE TPR TE TTP TE TP Te Te TE Te TET 0 
opcode 0x9 reserved 
6 bits 26 bits 
Format: 
l.rfe 
Description: 


Execution of this instruction partially restores the state of the processor prior to the 
exception. This instruction does not have a delay slot. 


32-bit Implementation: 


PC <- EPCR 
SR «+ ESR 


64-bit Implementation: 


PC «- EPCR 
SR <«- ESR 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lror Rotate Right Lror 
31 if |). fe. |) 2625 Ji |i h |) 2120 |) |e JL || aes |i]. |. |) 120 9 “ls 65 43). ||. 0 
opcode 0x38 D A B reserved opcode 0x3 reserved opcode 0x8 
6 bits 5 bits 5 bits 5 bits 1 bit 4 bits 2 bits 4 bits 

Format: 


1l.ror rD,rA,rB 

Description: 

General-purpose register rB specifies the number of bit positions; the contents of general- 
purpose register rA are rotated right. The result is written into general-purpose register 
rD. 


32-bit Implementation: 


rD[31-rB[4:0]:0] < rA[31:rB[4:0]] 
rD[31:32-rB[4:0]] <— rA[rB[4:0]-1:0] 


64-bit Implementation: 


rD[63-rB[5:0]:0] < rA[63:rB[5:0] ] 
rD[63:64-rB[5:0]] < rA[rB[5:0]-1:0] 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lrori Rotate Right with Immediate Lrori 
31. Jef. i] f] 2625 |). fet). |] 2aja0 f/f. le |} aes fle iP fe fe fe HT 87 65 ||. |||. |||] O 
opcode 0x2e D A reserved opcode 0x3 L 
6 bits 5 bits 5 bits 8 bits 2 bits 6 bits 
Format: 


1l.rori rD,rA,L 

Description: 

The 6-bit immediate value specifies the number of bit positions; the contents of general- 
purpose register rA are rotated right. The result is written into general-purpose register 
rD. In 32-bit implementations bit 5 of immediate is ignored. 


32-bit Implementation: 


rD[31-L:0] < rA[34:L] 
rD[31:32-L] « rA[L-1:0] 


64-bit Implementation: 


rD[63-L:0] < rA[63:L] 
rD[63:64-L] < rA[L-1:0] 


Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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l.sb Store Byte l.sb 
31 ef. fT. fe 2625 - ile |i. 2120 Pan ee 1615 Pah ean 1170 fy. Jf. He Te Te Ve Te Te 0 
opcode 0x36 I A B I 
6 bits 5 bits 5 bits 5 bits 11 bits 
Format: 


1l.sb I(rA),rB 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The low-order 8 bits of general-purpose register rB 
are stored to memory location addressed by EA. 


32-bit Implementation: 


EA - exts(Immediate) + rA[31:0] 
(EA)[7:0] < rB[7:0] 


64-bit Implementation: 


EA - exts(Immediate) + rA[63:0] 
(EA)[7:0] < rB[7:0] 


Exceptions: 
TLB miss 


Page fault 
Bus error 


Instruction Class 
ORBIS32 I 


www.opencores.org 1.2-1 97 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


l.sd Store Double Word L.sd 
31 ee Wee [ec] He 2625 | pee life 2120 effe dye 1615 off. dy W110 Jf. Pe Jf. He J}. Te dye Pe Te 0 
opcode 0x34 I A B I 
6 bits 5 bits 5 bits 5 bits 11 bits 
Format: 


1.sd I(rA),rB 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The double word in general-purpose register rB is 
stored to memory location addressed by EA. 

32-bit Implementation: 

N/A 

64-bit Implementation: 


EA - exts(Immediate) + rA[63:0] 
(EA)[63:0] < rB[63:0] 


Exceptions: 
TLB miss 
Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS64 I 


www.opencores.org 1.2-1 98 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


L.sfeq Set Flag if Equal L.sfeq 
31 elhe de die Tf. J]. Te Je Uf 2120 Pall eae 1615 ee 1110 lade Mele Weed feed fade 0 
opcode 0x720 A B reserved 
11 bits 5 bits 5 bits 11 bits 
Format: 


1.sfeq rA,rB 
Description: 


The contents of general-purpose registers rA and rB are compared. If the contents are 
equal, the compare flag is set; otherwise the compare flag is cleared. 


32-bit Implementation: 

SR[F] < rA[31:0] == rB[31:0] 
64-bit Implementation: 

SR[F] < rA[63:0] == rB[63:0] 
Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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L.sfeqi Set Flag if Equal Immediate 


31 wit. Te Pe fe fe He Te TP 2120 meee 1615 
opcode 0x5e0 A 
11 bits 5 bits 


Format: 
1.sfeqi rA,I 


Description: 


September 7, 2017 


L.sfeqi 


The contents of general-purpose register rA and the sign-extended immediate value are 
compared. If the two values are equal, the compare flag is set; otherwise the compare flag 


is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] == exts(Immediate) 
64-bit Implementation: 

SR[F] < rA[63:0] == exts(Immediate) 
Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lsfges Set Flag if Greater or Equal Than Signed I.sfges 


31 elle fe dhe Te Te [Pe Je Tf 2120 Palle 1615 ee 1110 ld oe Mele Wee faced feed fe 0 
opcode 0x72b A B reserved 
11 bits 5 bits 5 bits 11 bits 


Format: 

1l.sfges rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as signed integers. If 
the contents of the first register are greater than or equal to the contents of the second 
register, the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] >= rB[31:0] 

64-bit Implementation: 

SR[F] < rA[63:0] >= rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Set Flag if Greater or Equal Than 


Lsfgesi . ; L.sfgesi 
8 Immediate Signed 8 
31 aM | Niae Mise tee Mee Mose ells 2120 . |. ff. 1615 eT Te TP. Te Te Te Te Te Te Te TE. Le TP 0 
opcode OxSeb A I 
11 bits 5 bits 16 bits 
Format: 


1.sfgesi rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as signed integers. If the contents of the first register are greater than or equal 
to the immediate value the compare flag is set; otherwise the compare flag is cleared. 
32-bit Implementation: 

SR[F] < rA[31:0] >= exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] >= exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Set Flag if Greater or Equal Than 


L.sfgeu : l.sfgeu 
8 Unsigned 8 
31 eThe Te qe Te Pe Pepe yf 2120 ee 1615 Jyh 1110 eT Te Te Te Te Te He 0 
opcode 0x723 A B reserved 
11 bits 5 bits 5 bits 11 bits 
Format: 


1.sfgeu rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as unsigned integers. If 
the contents of the first register are greater than or equal to the contents of the second 
register, the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] >= rB[31:0] 

64-bit Implementation: 

SR[F] <- rA[63:0] >= rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Set Flag if Greater or Equal Than 


Lsfgeui : : Lsfgeui 
8 Immediate Unsigned g 
31 eM | Nose Mise [ee Mae Mise] sells 2120 |. ff. 1615 eT Te TP. Te dpe Te Te He Te TP TE. TL TP 0 
opcode 0x5e3 A I 
11 bits 5 bits 16 bits 
Format: 


1.sfgeui rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as unsigned integers. If the contents of the first register are greater than or 
equal to the immediate value the compare flag is set; otherwise the compare flag is 
cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] >= exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] >= exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lsfgts Set Flag if Greater Than Signed L.sfgts 
31 elle Te dhe Te Te Te Je Tf 2120 Palle 1615 ee 1110 le Mele Weed faced fee} fe 0 
opcode 0x72a A B reserved 
11 bits 5 bits 5 bits 11 bits 
Format: 


l.sfgts rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as signed integers. If 
the contents of the first register are greater than the contents of the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] <- rA[31:0] > rB[31:0] 

64-bit Implementation: 

SR[F] < rA[63:0] > rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Set Flag if Greater Than Immediate 


L.sfgtsi : | .sfgtsi 
8 Signed 6 
31 edt Pe Pe The Te He Te Tp. 2120 J {fe tf. 1615 eT Te TP Te dpe Te Te TT Te The He TP 0 
opcode Ox5ea A I 
11 bits 5 bits 16 bits 
Format: 


1l.sfgtsi rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as signed integers. If the contents of the first register are greater than the 
immediate value the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] > exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] > exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lsfgtu Set Flag if Greater Than Unsigned Lsfgtu 


31 elhe Te dhe Tf. Te Te Je Uf 2120 Palle 1615 ee 1110 eve de Te Te Te Te Tf. He 0 
opcode 0x722 A B reserved 
11 bits 5 bits 5 bits 11 bits 


Format: 

1l.sfgtu rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as unsigned integers. If 
the contents of the first register are greater than the contents of the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] <- rA[31:0] > rB[31:0] 

64-bit Implementation: 

SR[F] < rA[63:0] > rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Set Flag if Greater Than Immediate 


Lsfgtui : L.sfgtui 
8 Unsigned 8 
31 eM Nise Mase tee Me Mts. [ses 2120 . |. ff. 1615 eT Te TP. Te Te Te Te He Te TP TP HL TP 0 
opcode 0x5e2 A I 
11 bits 5 bits 16 bits 
Format: 


1l.sfgtui rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as unsigned integers. If the contents of the first register are greater than the 
immediate value the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] > exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] > exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lsfles Set Flag if Less or Equal Than Signed __l.sfles 


31 elhe Te dhe Te Te Te Je Uf 2120 allem 1615 ee 1110 fede Mele Weed faced fae} fe 0 
opcode 0x72d A B reserved 
11 bits 5 bits 5 bits 11 bits 


Format: 

1l.sfles rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as signed integers. If 
the contents of the first register are less than or equal to the contents of the second 
register, the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] <= rB[31:0] 

64-bit Implementation: 

SR[F] <- rA[63:0] <= rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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que Set Flag if Less or Equal Than Immediate 1 


i Signed .Sflesi 


31 ede Pe Pe dhe Te He Te Tp. 2120 J {fe tf. 1615 eT Te TP Te dpe Te Te TT Te The He TP 0 
opcode OxSed A I 
11 bits 5 bits 16 bits 


Format: 

1.sflesi rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as signed integers. If the contents of the first register are less than or equal to 
the immediate value the compare flag is set; otherwise the compare flag is cleared. 
32-bit Implementation: 

SR[F] < rA[31:0] <= exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] <= exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 


www.opencores.org 1.2-1 110 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


l.sfleu Set Flag if Less or Equal Than Unsigned _|.sfleu 


31 elle Te fe Tf. Te Te Je Tf 2120 Palle 1615 ee 1110 ldo Mele Weed eccd feed fe 0 
opcode 0x725 A B reserved 
11 bits 5 bits 5 bits 11 bits 


Format: 

1.sfleu rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as unsigned integers. If 
the contents of the first register are less than or equal to the contents of the second 
register, the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] <= rB[31:0] 

64-bit Implementation: 

SR[F] <- rA[63:0] <= rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lsfleui Set Flag if Less or Equal Than Immediate 1 


; Sfleui 
Unsigned 
31 eeMe | Nie Mise ee Mie Mt | sells 2120 . |. ff. 1615 eT Te TP. Te dpe Te Te He Te TP TP Le TP 0 
opcode Ox5e5 A I 
11 bits 5 bits 16 bits 


Format: 

1.sfleui rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as unsigned integers. If the contents of the first register are less than or equal to 
the immediate value the compare flag is set; otherwise the compare flag is cleared. 
32-bit Implementation: 

SR[F] < rA[31:0] <= exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] <= exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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L.sflts Set Flag if Less Than Signed Lsflts 
31 elhe TTe die Tf. I]. Te Je Tf 2120 allele 1615 Jie df 1110 ldo Mele Whee faced feed fe 0 
opcode 0x72c A B reserved 
11 bits 5 bits 5 bits 11 bits 
Format: 


1l.sflts rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as signed integers. If 
the contents of the first register are less than the contents of the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] < rB[31:0] 

64-bit Implementation: 

SR[F] < rA[63:0] < rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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L.sfltsi Set Flag if Less Than Immediate Signed _|.sfltsi 


31 edt Tbe Pe Te Te He Te Te 2120 fc} fells 1615 eT Te Te Te Te Te Te Te Te TP TP. He TP 0 
opcode Ox5ec A I 
11 bits 5 bits 16 bits 


Format: 

1.sfltsi rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as signed integers. If the contents of the first register are less than the 
immediate value the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] < exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] < exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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Lsfltu Set Flag if Less Than Unsigned Lsfltu 
31 elle Te dhe [fe Te Te Je Uf 2120 Pall eae 1615 Jie d 1110 ele de Te Te Te Te Tf. He 0 
opcode 0x724 A B reserved 
11 bits 5 bits 5 bits 11 bits 
Format: 


1l.sfltu rA,rB 

Description: 

The contents of general-purpose registers rA and rB are compared as unsigned integers. If 
the contents of the first register are less than the contents of the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] < rB[31:0] 

64-bit Implementation: 

SR[F] < rA[63:0] < rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lsfltui Set Flag if Less Than Immediate Unsigned L.sfltui 


31 edt dye Te fe Te He Te Te 2120 fc} fells 1615 eT Te TT. Te Te Te Te He Te TP TP. Ee TP 0 
opcode 0x5e4 A I 
11 bits 5 bits 16 bits 


Format: 

1.sfltui rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared as unsigned integers. If the contents of the first register are less than the 
immediate value the compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] < exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] < exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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l.sfne Set Flag if Not Equal l.sfne 
31 elhe TTe Tle [fe I]. Te Je Tf 2120 ral eae 1615 ee 1110 le Melle Weed faced fae} fe 0 
opcode 0x721 A B reserved 
11 bits 5 bits 5 bits 11 bits 
Format: 


1l.sfne rA,rB 
Description: 


The contents of general-purpose registers rA and rB are compared. If the contents are not 
equal, the compare flag is set; otherwise the compare flag is cleared. 


32-bit Implementation: 

SR[F] < rA[31:0] != rB[31:0] 
64-bit Implementation: 

SR[F] < rA[63:0] != rB[63:0] 
Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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L.sfnei Set Flag if Not Equal Immediate L.sfnei 
31 edfe die Pe fe Te He Te Te 2120 fc} feel lts 1615 eT Te Te Te Te Te Te He Te Pe Te Ee TP 0 
opcode 0x5e1 A I 
11 bits 5 bits 16 bits 
Format: 


1l.sfnei rA,I 

Description: 

The contents of general-purpose register rA and the sign-extended immediate value are 
compared. If the two values are not equal, the compare flag is set; otherwise the compare 
flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] != exts(Immediate) 

64-bit Implementation: 

SR[F] < rA[63:0] != exts(Immediate) 

Exceptions: 


None 


Instruction Class 
ORBIS32 II 
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I.sh Store Half Word I.sh 
31 oe Wee |] 4] He 2625 | pee life 2120 effe dye 1615 off. de W110 J. Te Jf. He J]. Te J fe Pe de 0 
opcode 0x37 I A B I 
6 bits 5 bits 5 bits 5 bits 11 bits 
Format: 


1.sh I(rA),rB 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The low-order 16 bits of general-purpose register rB 
are stored to memory location addressed by EA. 


32-bit Implementation: 


EA - exts(Immediate) + rA[31:0] 
(EA)[15:0] < rB[15:0] 


64-bit Implementation: 


EA - exts(Immediate) + rA[63:0] 
(EA)[15:0] < rB[15:0] 


Exceptions: 
TLB miss 
Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS32 I 
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Lsll Shift Left Logical Lsll 
31 if |). Hef. |) 2625 Ji i |e |) 2120 |) |b J, || aes |i]. |. |) 1120 9 “lls 65 43). ||. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x8 
6 bits 5 bits 5 bits 5 bits 1 bit 4 bits 2 bits 4 bits 

Format: 


1.sll rD,rA,rB 

Description: 

General-purpose register rB specifies the number of bit positions; the contents of general- 
purpose register rA are shifted left, inserting zeros into the low-order bits. The result is 
written into general-purpose rD. In 32-bit implementations bit 5 of rB is ignored. 


32-bit Implementation: 


rD[31:rB[4:0]] < rA[31-rB[4:0]:0] 
rD[rB[4:0]-1:0] <0 


64-bit Implementation: 


rD[63:rB[5:0]] < rA[63-rB[5:0]:0] 
rD[rB[5:0]-1:0] - 0 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lslli Shift Left Logical with Immediate Lslli 
31. fe fe |] f] 2625 |). ef). |] 2aja0 f/f. le |] aes fle if fe fe fe HT 87 65 ||. |||. |||] O 
opcode 0x2e D A reserved opcode 0x0 L 
6 bits 5 bits 5 bits 8 bits 2 bits 6 bits 
Format: 


1.slli rD,rA,L 

Description: 

The immediate value specifies the number of bit positions; the contents of general- 
purpose register rA are shifted left, inserting zeros into the low-order bits. The result is 
written into general-purpose register rD. In 32-bit implementations bit 5 of immediate is 
ignored. 


32-bit Implementation: 


rD[31:L] < rA[31-L:0] 
rD[L-1:0] <- 0 


64-bit Implementation: 


rD[63:L] < rA[63-L:0] 
rD[L-1:0] < 0 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 


www.opencores.org 1.2-1 121 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Lsra Shift Right Arithmetic Lsra 
31 if |) fei. |) 2625 Ji ih |) 2120 |) h JL || aes |i]. |. |) 1120 9 “lls 65 43). ||. 0 
opcode 0x38 D A B reserved opcode 0x2 reserved opcode 0x8 
6 bits 5 bits 5 bits 5 bits 1 bit 4 bits 2 bits 4 bits 

Format: 


1.sra rD,rA,rB 

Description: 

General-purpose register rB specifies the number of bit positions; the contents of general- 
purpose register rA are shifted right, sign-extending the high-order bits. The result is 
written into general-purpose register rD. In 32-bit implementations bit 5 of rB is ignored. 


32-bit Implementation: 


rD[31-rB[4:0]:0] < rA[31:rB[4:0]] 
rD[31:32-rB[4:0]] <— rA[31] 


64-bit Implementation: 


rD[63-rB[5:0]:0] < rA[63:rB[5:0] ] 
rD[63:64-rB[5:0]] < rA[63] 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lsrai Shift Right Arithmetic with Immediate _ Lsrai 


31. Jef. |] f] 2625 J). |e). |] 2aj20 f/f. fe |} aes fle if fe Pe Ife HT 87 65 ||. |||. |||] O 
opcode 0x2e D A reserved opcode 0x2 L 
6 bits 5 bits 5 bits 8 bits 2 bits 6 bits 
Format: 


1.srai rD,rA,L 

Description: 

The 6-bit immediate value specifies the number of bit positions; the contents of general- 
purpose register rA are shifted right, sign-extending the high-order bits. The result is 
written into general-purpose register rD. In 32-bit implementations bit 5 of immediate is 
ignored. 


32-bit Implementation: 


rD[31-L:0] < rA[31:L] 
rD[31:32-L] < rA[34] 


64-bit Implementation: 


rD[63-L:0] < rA[63:L] 
rD[63:64-L] < rA[63] 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lsrl Shift Right Logical Lsrl 
31 if |). fei. |) 2625 Ji hh |) 2120 |) /h JL || aes |i]. |. |) 1120 9 “ls 65 43). ||. 0 
opcode 0x38 D A B reserved opcode 0x1 reserved opcode 0x8 
6 bits 5 bits 5 bits 5 bits 1 bit 4 bits 2 bits 4 bits 

Format: 


1.srl rD,rA,rB 

Description: 

General-purpose register rB specifies the number of bit positions; the contents of general- 
purpose register rA are shifted right, inserting zeros into the high-order bits. The result is 
written into general-purpose register rD. In 32-bit implementations bit 5 of rB is ignored. 


32-bit Implementation: 


rD[31-rB[4:0]:0] < rA[31:rB[4:0]] 
rD[31:32-rB[4:0]] < 0 


64-bit Implementation: 


rD[63-rB[5:0]:0] < rA[63:rB[5:0] ] 
rD[63:64-rB[5:0]] < 0 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lsrli Shift Right Logical with Immediate Lsrli 
31. fe |f- |]. f] 2625 |). |e). |] 2aj20 f/f. le} aes fle if. fe fe fe HT 87 65 ||. |||.) || O 
opcode 0x2e D A reserved opcode 0x1 L 
6 bits 5 bits 5 bits 8 bits 2 bits 6 bits 
Format: 


1.srli rD,rA,L 

Description: 

The 6-bit immediate value specifies the number of bit positions; the contents of general- 
purpose register rA are shifted right, inserting zeros into the high-order bits. The result is 
written into general-purpose register rD. In 32-bit implementations bit 5 of immediate is 
ignored. 


32-bit Implementation: 


rD[31-L:0] < rA[31:L] 
rD[31:32-L] - 0 


64-bit Implementation: 


rD[63-L:0] < rA[63:L] 
rD[63:64-L] < 0 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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l.sub Subtract l.sub 
31 if |) Hef. |) 2625 Ji hh |) 2120 |) kJ) || aes |i]. |. |) 120 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x2 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits A bits A bits 
Format: 


1.sub rD,rA,rB 

Description: 

The contents of general-purpose register rB are subtracted from the contents of general- 
purpose register rA to form the result. The result is placed into general-purpose register 


rD. 


The instruction will set the carry flag on unsigned overflow, and the overflow flag on 
signed overflow. 


32-bit Implementation: 

rD[31:0] < rA[31:0] - rB[31:0] 
SR[CY] - carry (unsigned overflow) 
SR[OV] - signed overflow 

64-bit Implementation: 

rD[63:0] < rA[63:0] - rB[63:0] 

SR[CY ] - carry (unsigned overflow) 
SR[OV] - Signed overflow 
Exceptions: 


Range Exception onoverflow if SR[OVE] and AECR[OVADDE] are set. 
Range Exception oncarryif SR[OVE] and AECR[CYADDE] are set. 


Instruction Class 
ORBIS32 I 
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L.sw Store Single Word l.sw 
31 ef. fT. df. 2625 - ile |i. 2120 Pan ee 1615 Pah ean 1120 fy. Jf. ye Te Te Ve Te Te 0 
opcode 0x35 I A B I 
6 bits 5 bits 5 bits 5 bits 11 bits 
Format: 


1.sw I(rA),rB 

Description: 

The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The low-order 32 bits of general-purpose register rB 
are stored to memory location addressed by EA. 


32-bit Implementation: 


EA - exts(Immediate) + rA[31:0] 
(EA)[31:0] < rB[31:0] 


64-bit Implementation: 


EA - exts(Immediate) + rA[63:0] 
(EA)[31:0] < rB[31:0] 


Exceptions: 
TLB miss 
Page fault 


Bus error 
Alignment 


Instruction Class 
ORBIS32 I 
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l.swa Store Single Word Atomic l.swa 
31 oie fT. fe 2625 - ile |i. 2120 1615 Pa ean 1170 fy. Jf. He Te de Ve Te Te 0 
opcode 0x33 I A B I 
6 bits 5 bits 5 bits 5 bits 11 bits 
Format: 


1.swa I(rA),rB 


Description: 


The offset is sign-extended and added to the contents of general-purpose register rA. The 
sum represents an effective address. The low-order 32 bits of general-purpose register rB 
are conditionally stored to memory location addressed by EA. The ‘atomic’ condition 
relies on that an atomic reserve to EA is still intact. When the MMU is enabled, the 
physical translation of EA is used to do the address comparison. 


32-bit Implementation: 


EA 
if (atomic) (EA)[31:0] 
SR[F ] 


64-bit Implementation: 

EA 

if (atomic) (EA)[31:0] 
SR[F] 

Exceptions: 

TLB miss 

Page fault 


Bus error 
Alignment 


www.opencores.org 





- exts(Immediate) + rA[31:0] 
rB[31:0] 
- atomic 


t 


- exts(Immediate) + rA[63:0] 
- rB[31:0] 
- atomic 


Instruction Class 
ORBIS32 II 


1.2-1 128 of 357 


OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Lsys System Call Lsys 
B10 Tf. Te Te Te Ye Le Te TP Te Te Te TP Pe TT TP 6S HP. Ye TT He TP Te Te TE Te Te TPR Te TE TE 0 
opcode 0x2000 K 
16 bits 16 bits 
Format: 
l.sys K 
Description: 


Execution of the system call instruction results in the system call exception. The system 
calls exception is a request to the operating system to provide operating system services. 
The immediate value can be used to specify which system service is requested, 
alternatively a GPR defined by the ABI can be used to specify system service. 

Because an l|.sys causes an intentional exception, rather than an interruption of normal 
processing, the matching I.rfe returns to the next instruction. As this is considered to be 
the jump itself for exceptions occurring in a delay slot, l.sys should not be placed in a 
delay slot. 

32-bit Implementation: 

system-call-exception(k) 

64-bit Implementation: 

system-call-exception(k) 

Exceptions: 


System Call 


Instruction Class 
ORBIS32 I 
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l.trap Trap l.trap 
B10 Tf. Te Te Te Ye Le Te TP Te TE Te TP, Pe TT TE 6S TP. Ye TP Te He TP Te Te TE. TP Te Te Te TE TE 0 
opcode 0x2100 K 
16 bits 16 bits 
Format: 
1.trap K 
Description: 


Trap exception is a request to the operating system or to the debug facility to execute 
certain debug services. Immediate value is used to select which SR bit is tested by trap 
instruction. 

32-bit Implementation: 

trap-exception( ) 

64-bit Implementation: 

trap-exception( ) 


Exceptions: 


Trap exception 


Instruction Class 
ORBIS32 II 
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L.xor Exclusive Or L.xor 
31 if |) Hef. |) 2625 Ji hh |) 2120 |) kJ) || aes |i]. |. |) 120 9 g7 |i. ||. |] 43 If. |. 0 
opcode 0x38 D A B reserved opcode 0x0 reserved opcode 0x5 
6 bits 5 bits 5 bits 5 bits 1 bit 2 bits A bits A bits 

Format: 


1.xor rD,rA,rB 

Description: 

The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical XOR operation. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] XOR rB[31:0] 

64-bit Implementation: 

rD[63:0] < rA[63:0] XOR rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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Lxori Exclusive Or with Immediate Half Word _l|.xori 


31 Jit. Te Tp. 26 25 Talal e 2120 {fe |. wis ff. Hf LT Te Te Te Ree Ye TP TP TP 0 
opcode 0x2b D A I 
6 bits 5 bits 5 bits 16 bits 
Format: 


1.xori rD,rA,I 

Description: 

The immediate value is sign-extended and combined with the contents of general-purpose 
register rA in a bit-wise logical XOR operation. The result is placed into general-purpose 
register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] XOR exts(Immediate) 

64-bit Implementation: 

rD[63:0] < rA[63:0] XOR exts(Immediate) 


Exceptions: 


None 


Instruction Class 
ORBIS32 I 
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If.add.d Add Floating-Point Double-Precision §If.add.d 


31 wife le |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || teas |). |). |}. |] 1/10 . iy |i. \f. lf. lf. fe | 0 
opcode 0x32 D A B reserved opcode 0x10 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


5.4 ORFPX32/64 


Format: 
lf.add.d rD,rA,rB 
Description: 


The contents of general-purpose register rA are added to the contents of general-purpose 
register rB to form the result. The result is placed into general-purpose register rD. 


32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] < rA[63:0] + rB[63:0] 
Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 I 
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If.add.s Add Floating-Point Single-Precision — If.add.s 


31 wife le |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || teas |). |). |]. |] 1/10 . Bi7 |i. \f. lf. fe fe | 0 
opcode 0x32 D A B reserved opcode 0x0 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.add.s rD,rA,rB 
Description: 


The contents of general-purpose register rA are added to the contents of general-purpose 
register rB to form the result. The result is placed into general-purpose register rD. 


32-bit Implementation: 
rD[31:0] < rA[31:0] + rB[31:0] 
64-bit Implementation: 


rD[31:0] < rA[31:0] + rB[31:0] 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 I 
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Reserved for ORFPX64 Custom 


If.cust1.d ‘ If.cust1.d 
Instructions 
31. |i. |). |]. |) 2625 Jf. |. fj.) 2a20 |). |} 1645 |). ]).]).]) atj10 || 8i7 |. 43 |. |). 0 
opcode 0x32 reserved A B reserved opcode Oxe reserved 
6 bits 5 bits 5 bits 5 bits 3 bits A bits A bits 
Format: 


lf.custi.d rA,rB 

Description: 

This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but instead by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORFPX64 II 
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Reserved for ORFPX32 Custom 


If.cust1.s ‘ If.cust1.s 
Instructions 
31]. |i. |f. ||. |] 2625 | f/f | 2120 |.) 4) 1645 |). |||). |) 1}10 . 87 . |i. 43 ||. ||. 0 
opcode 0x32 reserved A B reserved opcode Oxd reserved 
6 bits 5 bits 5 bits 5 bits 3 bits 4 bits A bits 
Format: 


1lf.custi.s rA,rB 

Description: 

This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but instead by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORFPX32 II 
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If.div.d Divide Floating-Point Double-Precision If.div.d 


31 wife le |e 2625 ||. ||. ||. || 21/20  |/. ||. ||. || teas |). |). |}. |] 1}10 . Biz |i. |}. lf. fe fe | 0 
opcode 0x32 D A B reserved opcode 0x13 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.div.d rD,rA,rB 

Description: 

The contents of general-purpose register rA are divided by the contents of general- 
oo register rB to form the result. The result is placed into general-purpose register 
32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] < rA[63:0] / rB[63:0] 

Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 II 
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If.div.s Divide Floating-Point Single-Precision _ If.div.s 


31 wife Hye |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || 165 |). |). |]. |} 1/10 . Bi7 |i. \f. lf. fe fe | 0 
opcode 0x32 D A B reserved opcode 0x3 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.div.s rD,rA,rB 

Description: 

The contents of general-purpose register rA are divided by the contents of general- 
oo register rB to form the result. The result is placed into general-purpose register 
32-bit Implementation: 

rD[31:0] < rA[31:0] / rB[31:0] 

64-bit Implementation: 


rD[31:0] < rA[31:0] / rB[31:0] 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 II 
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If. ftoi.d Floating-Point Double-Precision To 


If.ftoi.d 
Integer 
31 elie ]ie ff. |] 2625 ff. ff. ]f. |} 2a20 |). Jf. |. ]f aes ied | 1110 . BI7 ||. Jf. |. ||. fle |] 0 
opcode 0x32 D A opcode 0x0 reserved opcode 0x15 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
1lf.ftoi.d rD,rA 
Description: 


The contents of general-purpose register rA are converted to an integer and stored in 
general-purpose register rD. 


32-bit Implementation: 

N/A 

64-bit Implementation: 
rD[63:0] <— ftoi(rA[63:0]) 
Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 I 
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Floating-Point Single-Precision To 


If.ftoi.s If.ftoi.s 
Integer 
31. |. |]. |f- |} 2625) |. |i. |). |] 2/20 |. |). |). |] 16i45 ied | 1110 . BI7 ||. ||. |. ||. fle |] 0 
opcode 0x32 D A opcode 0x0 reserved opcode 0x5 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.ftoi.s rD,rA 
Description: 


The contents of general-purpose register rA are converted to an integer and stored into 
general-purpose register rD. 


32-bit Implementation: 
rD[31:0] < ftoi(rA[31:0]) 
64-bit Implementation: 


rD[31:0] < ftoi(rA[31:0]) 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 I 
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If.itof.d Integer To Floating-Point Double- 


e e If.itof.d 
Precision 
31 elie ]ie ff. |] 2625 ff. ff. ]f. |} 2a20 |). Jf. |. ]f aes eae 1110 . BI7 ||. ||. |. ||. fle |] 0 
opcode 0x32 D A opcode 0x0 reserved opcode 0x14 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lf.itof.d rD,rA 
Description: 


The contents of general-purpose register rA are converted to a double-precision floating- 
point number and stored in general-purpose register rD. 


32-bit Implementation: 

N/A 

64-bit Implementation: 
rD[63:0] < itof(rA[63:0]) 
Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 I 
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Integer To Floating-Point Single- 


If.itof.s ie If.itof.s 
Precision 
31 elie ]ie ff. |] 2625 ff. ff. ]f. |} 2a20 |). Jf. |. ]f aes eae 1110 . BI7 ||. ||. ||. ||. fle |] 0 
opcode 0x32 D A opcode 0x0 reserved opcode 0x4 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.itof.s rD,rA 
Description: 


The contents of general-purpose register rA are converted to a single-precision floating- 
point number and stored into general-purpose register rD. 


32-bit Implementation: 
rD[31:0] < itof(rA[31:0]) 
64-bit Implementation: 


rD[31:0] < itof(rA[31:0]) 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 I 
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Multiply and Add Floating-Point 


If.madd.d abe If.madd.d 
Double-Precision 
31 {fee Ue 2625 |{. ||. ||. || 2120 Jf. |]. /).]} 1645 |). |. \. |} aio ; gi7_ ||. J). |). |). |]. Jf. 0 
opcode 0x32 D A B reserved opcode 0x17 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.madd.d rD,rA, rB 
Description: 


The contents of general-purpose register rA are multiplied by the contents of general- 
purpose register rB, and added to special-purpose register FRMADDLO/FPMADDHI. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


FPMADDHI[31:0]FPMADDLO[31:0] — rA[63:0] * rB[63:0] + 
FPMADDHI[31:0]FPMADDLO[31:0] 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 II 
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if add « Multiply and Add Floating-Point 


: ma If.madd.s 
Single-Precision 
31 fie fe {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. Pee /f telis f/f). fl. f} ta fto . Bi7 |i. \f. |. lf. fe |e 0 
opcode 0x32 D A B reserved opcode 0x7 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.madd.s rD,rA,rB 
Description: 


The contents of general-purpose register rA are multiplied by the contents of general- 
purpose register rB, and added to special-purpose register FPMADDLO/FPMADDHI. 


32-bit Implementation: 


FPMADDHI[31:0]FPMADDLO[31:0] — rA[31:0] * rB[31:0] + 
FPMADDHI[31:0]FPMADDLO[31:0] 


64-bit Implementation: 

FPMADDHI[31:0]FPMADDLO[31:0] < rA[31:0] * rB[31:0] + 
FPMADDHI[31:0]FPMADDLO[31:0] 

FPMADDHI < 0 

FPMADDLO < 0 

Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 II 
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Multiply Floating-Point Double- 


If.mul.d =, If.mul.d 
Precision 
31 {fee Ue 2625 |{. ||. ||. || 2120 Jf. |}. /).]} 1645 |i. |. {{. |} aio ; gi7_ ||. J). |). |). |. Jf. 0 
opcode 0x32 D A B reserved opcode 0x12 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.mul.d rD,rA, rB 

Description: 

The contents of general-purpose register rA are multiplied by the contents of general- 
ae register rB to form the result. The result is placed into general-purpose register 
32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] « rA[63:0] * rB[63:0] 

Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 I 
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If.mul.s Multiply Floating-Point Single-Precision If.mul.s 


31 wf He |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || 165 |). |). |]. |] 1/10 . Biz |i. \f. lf. fe fe | 0 
opcode 0x32 D A B reserved opcode 0x2 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1f.mul.s rD,rA,rB 

Description: 

The contents of general-purpose register rA are multiplied by the contents of general- 
a register rB to form the result. The result is placed into general-purpose register 
32-bit Implementation: 

rD[31:0] < rA[31:0] * rB[31:0] 

64-bit Implementation: 


rD[31:0] < rA[31:0] * rB[31:0] 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 I 
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Remainder Floating-Point Double- 


If.rem.d ae: If.rem.d 
Precision 
31 {fee Ue 2625 |{. ||. ||. || 2120 Jf. |}. /).]} 1645 |i. |. {{. |} aio ; gi7_ ||. J). |). |). |. Jf. 0 
opcode 0x32 D A B reserved opcode 0x16 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.rem.d rD,rA,rB 

Description: 

The contents of general-purpose register rA are divided by the contents of general- 
purpose register rB, and remainder is used as the result. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] « rA[63:0] % rB[63:0] 

Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 II 
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Remainder Floating-Point Single- 


If.rem.s ae If.rem.s 
Precision 
31 {fee Ue 2625 |{. ||. ||. || 2120 Jf. |}. /).]} 1645 |i. |. {{. |} aio ; 87 ||. J). |). |). |. Jf. 0 
opcode 0x32 D A B reserved opcode 0x6 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.rem.s rD,rA,rB 

Description: 

The contents of general-purpose register rA are divided by the contents of general- 
purpose register rB, and remainder is used as the result. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

rD[31:0] < rA[31:0] % rB[31:0] 

64-bit Implementation: 


rD[31:0] < rA[31:0] % rB[31:0] 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 II 
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Set Flag if Equal Floating-Point 


If.sfeq.d a If.sfeq.d 
q Double-Precision q 
31 elie ff. {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. Pee} telas f/f). fl. f} afte . Bi7 |i. \f. lf. |e fe | 0 
opcode 0x32 reserved A B reserved opcode 0x18 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sfeq.d rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the two registers are equal, the compare flag is set; otherwise the 
compare flag is cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

SR[F] <- rA[63:0] == rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORFPX64 I 
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Set Flag if Equal Floating-Point Single- 


If.sfeq.s nage If.sfeq.s 
Precision 
31 elie fe {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. yee /f telas f/f). fief} afte . Bi7 |i. \f. |e lf. fe | 0 
opcode 0x32 reserved A B reserved opcode 0x8 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.sfeq.s rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the two registers are equal, the compare flag is set; otherwise the 
compare flag is cleared. 

32-bit Implementation: 

SR[F] <- rA[31:0] == rB[31:0] 

64-bit Implementation: 

SR[F] < rA[31:0] == rB[31:0] 

Exceptions: 


None 


Instruction Class 
ORFPX32 I 
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Set Flag if Greater or Equal Than 
If.stge.d Floating-Point Double-Precision It.stge.d 


31 wf. Hye |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. ]| 1645 |). |). |). |) 1/10 . Bi7 |i. \f. |e |e fe | 0 
opcode 0x32 reserved A B reserved opcode 0x1b 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sfge.d rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is greater than or equal to the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

SR[F] < rA[63:0] >= rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORFPX64 I 
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Set Flag if Greater or Equal Than 


If.sfge.s : ee <i If.sfge.s 
g Floating-Point Single-Precision 8 
31 elie fe {fe |] 2625 |f. |]. Jf. |] 2aizo |]. ee /f telas f/f). fle} ta fto . Biz |i. \f. |e |e fe | 0 
opcode 0x32 reserved A B reserved opcode Oxb 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.sfge.s rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is greater than or equal to the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] >= rB[31:0] 

64-bit Implementation: 

SR[F] < rA[31:0] >= rB[31:0] 

Exceptions: 


None 


Instruction Class 
ORFPX32 I 
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Set Flag if Greater Than Floating-Point 


If.sfgt.d ae If.sfgt.d 
g Double-Precision 8 
31 elie ff. {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. Pee} telas f/f). fl. f} afte . Bi7 |i. \f. lf. |e fe | 0 
opcode 0x32 reserved A B reserved opcode Ox1a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sfgt.d rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is greater than the second register, the compare flag 
is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

SR[F] <- rA[63:0] > rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORFPX64 I 
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Set Flag if Greater Than Floating-Point 


If.sfgt.s : - If.sfgt.s 
g Single-Precision 8 
31 fie fe {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. eye} telas f/f). fle} afte . Bi7 |i. |}. lf. | fe fe | 0 
opcode 0x32 reserved A B reserved opcode Oxa 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.sfgt.s rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is greater than the second register, the compare flag 
is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] < rA[31:0] > rB[31:0] 

64-bit Implementation: 

SR[F] < rA[31:0] > rB[31:0] 

Exceptions: 


None 


Instruction Class 
ORFPX32 I 
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Set Flag if Less or Equal Than Floating- 
oe Point Double-Precision nore 


31 wf. Hye |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || 16445 |). |). |). |) 1/10 . Bi7 |i. \f. lf. |e fe | 0 
opcode 0x32 reserved A B reserved opcode Oxid 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.sfle.d rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is less than or equal to the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

SR[F] < rA[63:0] <= rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORFPX64 I 
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If.sfle.s Set Flag if Less or Equal Than Floating- If.sfle.s 
Point Single-Precision 


31 wf. Hye |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || 1645 |). |). |). |) 1}10 . Bi7 |i. \f. |. |e fe | 0 
opcode 0x32 reserved A B reserved opcode Oxd 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sfle.s rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is less than or equal to the second register, the 
compare flag is set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[LF] < rA[31:0] <= rB[31:0] 

64-bit Implementation: 

SR[F] < rA[31:0] <= rB[31:0] 

Exceptions: 


None 


Instruction Class 
ORFPX32 I 
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Set Flag if Less Than Floating-Point 


If.sflt.d a If.sflt.d 
Double-Precision 
31 wf. Hye |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || 165 |). |). |). |) 1/10 . Biz |i. \f. |. fe fe | 0 
opcode 0x32 reserved A B reserved opcode Oxic 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sflt.d rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is less than the second register, the compare flag is 
set; otherwise the compare flag is cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

SR[F] <- rA[63:0] < rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORFPX64 I 
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If. sflt.s Set Flag if Less Than Floating-Point 


e e e If.sflt.s 
Single-Precision 
31 fie fe {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. eye} telas f/f). fle} afte . Bi7 |i. \f. |e fe fe | 0 
opcode 0x32 reserved A B reserved opcode Oxc 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1f.sflt.s rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the first register is less than the second register, the compare flag is 
set; otherwise the compare flag is cleared. 

32-bit Implementation: 

SR[F] <- rA[31:0] < rB[31:0] 

64-bit Implementation: 

SR[F] < rA[31:0] < rB[31:0] 

Exceptions: 


None 


Instruction Class 
ORFPX32 I 
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Set Flag if Not Equal Floating-Point 


If.sfne.d ae If.sfne.d 
Double-Precision 
31 elie ff. {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. Pee} telas f/f). fl. f} afte . Biz |i. \f. |e |e fe |e 0 
opcode 0x32 reserved A B reserved opcode 0x19 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lf.sfne.d rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the two registers are not equal, the compare flag is set; otherwise the 
compare flag is cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

SR[F] <- rA[63:0] != rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORFPX64 I 
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Set Flag if Not Equal Floating-Point 


If.sfne.s ‘ — If.sfne.s 
Single-Precision 
31 fie fe {fe |] 2625 |f. |]. Jf. |] 2aiz0 |]. eye} telas f/f). fle} afte . Bi7 |i. \f. |e |e fe | 0 
opcode 0x32 reserved A B reserved opcode 0x9 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sfne.s rA,rB 

Description: 

The contents of general-purpose register rA and the contents of general-purpose register 
rB are compared. If the two registers are not equal, the compare flag is set; otherwise the 
compare flag is cleared. 

32-bit Implementation: 

SR[F] <- rA[31:0] != rB[31:0] 

64-bit Implementation: 

SR[F] < rA[31:0] != rB[31:0] 

Exceptions: 


None 


Instruction Class 
ORFPX32 I 
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Subtract Floating-Point Double- 


If.sub.d isa! If.sub.d 
Precision 
31 wf. Hye |e 2625 ||. ||. ||. || 21/20 ||. ||. ||. || 1645 |). |). |). |) 1/10 . Biz |i. \f. |e lf. |fe | 0 
opcode 0x32 D A B reserved opcode 0x11 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1lf.sub.d rD,rA, rB 

Description: 

The contents of general-purpose register rB are subtracted from the contents of general- 
es register rA to form the result. The result is placed into general-purpose register 
32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] « rA[63:0] - rB[63:0] 

Exceptions: 


Floating Point 


Instruction Class 
ORFPX64 I 
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If.sub.s Subtract Floating-Point Single-Precision If.sub.s 


31 wf lye |e 2625 ||. ||. ||. || 21/20 |/. ||. ||. || teas |). |). |}. |] 1/10 . Bi7 |i. \f. lf. fe fe | 0 
opcode 0x32 D A B reserved opcode 0x1 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


1f.sub.s rD,rA,rB 

Description: 

The contents of general-purpose register rB are subtracted from the contents of general- 
a register rA to form the result. The result is placed into general-purpose register 
32-bit Implementation: 

rD[31:0] « rA[31:0] - rB[31:0] 

64-bit Implementation: 


rD[31:0] < rA[31:0] - rB[31:0] 
rD[63:32] < 0 


Exceptions: 


Floating Point 


Instruction Class 
ORFPX32 I 
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Iv.add.b Vector Byte Elements Add Signed _ lv.add.b 


31 ae Melee Mis 2625 scalises 8 2120 Pal eaie 1615 Hs lls 1110 : 87 


opcode Oxa D A B reserved opcode 0x30 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


5.5 ORVDX64 


Format: 

lv.add.b rD,rA, rB 

Description: 

The byte elements of general-purpose register rA are added to the byte elements of 
general-purpose register rB to form the result elements. The result elements are placed 
into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[ 7:0] - rA[7:0] + rB[7:0] 
rD[15:8] < rA[15:8] + rB[15:8] 
rD[23:16] -— rA[23:16] + rB[23:16] 
rD[31:24] <— rA[31:24] + rB[31:24] 
rD[39:32] -— rA[39:32] + rB[39:32] 
rD[47:40] - rA[47:40] + rB[ 47:40] 
rD[55:48] <— rA[55:48] + rB[55:48] 
rD[63:56] - rA[63:56] + rB[63:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Add 


Iv.add.h ; Iv.add.h 
Signed 
B31}. |. |]. |]. |] 2625 |. Jf. |}. |) 2tjzo |]. |). fe |} teas |). ff. Jf. J] tate ; a7 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x31 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.add.h rD,rA,rB 

Description: 

The half-word elements of general-purpose register rA are added to the half-word 
elements of general-purpose register rB to form the result elements. The result elements 
are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] «< rA[15:0] + rB[15:0] 
rD[31:16] -— rA[31:16] + rB[31:16] 
rD[47:32] <— rA[47:32] + rB[47:32] 
rD[63:48] <— rA[63:48] + rB[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Add Signed 


lv.adds.b lv.adds.b 
Saturated 
31) |. fi. if. He} 2625 FT dey} 2tiao LTP }} teas Jf.) Fe TF ato : ai7 |}. |i. |i. ||. tel. 0 
opcode Oxa D A B reserved opcode 0x32 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.adds.b rD,rA, rB 

Description: 

The byte elements of general-purpose register rA are added to the byte elements of 
general-purpose register rB to form the result elements. If the result exceeds the min/max 
value for the destination data type, it is saturated to the min/max value and placed into 
general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] - sat8s(rA[7:0] + rB[7:0]) 
rD[15:8] < sat8s(rA[15:8] + rB[15:8]) 
rD[ 23:16] -— sat8s(rA[23:16] + rB[23:16]) 
rD[31:24] -— sat8s(rA[31:24] + rB[31:24]) 
rD[39:32] <— sat8s(rA[39:32] + rB[39:32]) 
rD[ 47:40] - sat8s(rA[47:40] + rB[47:40]) 
rD[ 55:48] - sat8s(rA[55:48] + rB[55:48]) 
rD[63:56] <— sat8s(rA[63:56] + rB[63:56]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Add 


Iv.adds.h : Iv.adds.h 
Signed Saturated 
B31. |]. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). i. |} teas |. ff. Jf. |] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x33 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.adds.h rD,rA, rB 

Description: 

The half-word elements of general-purpose register rA are added to the half-word 
elements of general-purpose register rB to form the result elements. If the result exceeds 
the min/max value for the destination data type, it is saturated to the min/max value and 
placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < sati6s(rA[15:0] + rB[15:0]) 
rD[31:16] <— satié6s(rA[31:16] + rB[31:16]) 
rD[ 47:32] -— sati6s(rA[47:32] + rB[47:32]) 
rD[63:48] -— sati6s(rA[63:48] + rB[63:48] ) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.addu.b Vector Byte Elements Add Unsigned Iv.addu.b 


31 ||. |). Jf |]. |] 26i@5 fe Jf. f}e ) 2tjao yf. fe fe |} teas). ff. J] tate a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x34 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.addu.b rD,rA, rB 

Description: 

The unsigned byte elements of general-purpose register rA are added to the unsigned byte 
elements of general-purpose register rB to form the result elements. The result elements 
are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[ 7:0] - rA[7:0] + rB[7:0] 
rD[15:8] < rA[15:8] + rB[15:8] 
rD[23:16] <— rA[23:16] + rB[23:16] 
rD[31:24] <-— rA[31:24] + rB[31:24] 
rD[39:32] -— rA[39:32] + rB[39:32] 
rD[47:40] <— rA[47:40] + rB[47:40] 
rD[ 55:48] - rA[55:48] + rB[55:48] 
rD[63:56] -— rA[63:56] + rB[63:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Add 


Iv.addu.h ‘ Iv.addu.h 
Unsigned 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajeo |]. |). i. |} teas |. ff. Jf. J] tate ; a7 |i. |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x35 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.addu.h rD,rA, rB 

Description: 

The unsigned half-word elements of general-purpose register rA are added to the 
unsigned half-word elements of general-purpose register rB to form the result elements. 
The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] «< rA[15:0] + rB[15:0] 
rD[31:16] -— rA[31:16] + rB[31:16] 
rD[47:32] <— rA[47:32] + rB[47:32] 
rD[63:48] <— rA[63:48] + rB[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Add 


Iv.addus.b : Iv.addus.b 
Unsigned Saturated 
B31. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjzo |]. |). |. |} teas |). ff. fe J] tate ; a7 |i. ||. He le [fe | 0 
opcode Oxa D A B reserved opcode 0x36 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.addus.b rD,rA,rB 

Description: 

The unsigned byte elements of general-purpose register rA are added to the unsigned byte 
elements of general-purpose register rB to form the result elements. If the result exceeds 
the min/max value for the destination data type, it is saturated to the min/max value and 
placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] - sat8u(rA[7:0] + rB[7:0]) 
rD[15:8] < sat8u(rA[15:8] + rB[15:8]) 
rD[23:16] -— sat8u(rA[23:16] + rB[23:16]) 
rD[31:24] -— sat8u(rA[31:24] + rB[31:24]) 
rD[39:32] <— sat8u(rA[39:32] + rB[39:32]) 
rD[ 47:40] - sat8u(rA[47:40] + rB[47:40]) 
rD[55:48] -— sat8u(rA[55:48] + rB[55:48]) 
rD[63:56] <— sat8u(rA[63:56] + rB[63:56]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Add 


Iv.addus.h : Iv.addus.h 
Unsigned Saturated 
B31. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjzo |]. |). |. |} teas |). ff. fe J] tate ; a7 |i. ||. He le [fe | 0 
opcode Oxa D A B reserved opcode 0x37 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.addus.h rD,rA,rB 

Description: 

The unsigned half-word elements of general-purpose register rA are added to the 
unsigned half-word elements of general-purpose register rB to form the result elements. 
If the result exceeds the min/max value for the destination data type, it is saturated to the 
min/max value and placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < sati6s(rA[15:0] + rB[15:0]) 
rD[31:16] <— satié6s(rA[31:16] + rB[31:16]) 
rD[ 47:32] -— sati6s(rA[47:32] + rB[47:32]) 
rD[63:48] -— sati6s(rA[63:48] + rB[63:48] ) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.all_eq.b Vector Byte Elements All Equal lv.all_eq.b 


31. |/. Jf. |]. |] 26i@5 fe. tJ.) 2tjao if. fe fe |} teas). ff. fe |] tate B77 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x10 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.all_eq.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if all corresponding elements are 
equal; otherwise the compare flag is cleared. The compare flag is replicated into all bit 
positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag <- rA[7:0] == rB[7:0] && 
rA[15:8] == rB[15:8] && 
rA[23:16] == rB[23:16] && 
rA[31:24] == rB[31:24] && 
rA[39:32] == rB[39:32] && 


rA[47:40] == rB[47:40] && 
rA[55:48] == rB[55:48] && 
rA[63:56] == rB[63:56] 
rD[63:0] < repl(flag) 
Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements All 


Iv.all_eq.h Equal 


Iv.all_eq.h 


31 ae hse ee Hil 2625 scilliew’| fs 2120 Par eaes 1615 = Wells 1110 


opcode Oxa D A B reserved opcode 0x11 


6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_eq.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if all corresponding 
elements are equal; otherwise the compare flag is cleared. The compare flag is replicated 
into all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[15:0] == rB[15:0] && 
rA[31:16] == rB[31:16] && 
rA[47:32] == rB[47:32] && 
rA[63:48] == rB[63:48] 
rD[63:0] < repl(flag) 
Exceptions: 
None 
Instruction Class 
ORVDX64 I 
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Vector Byte Elements All Greater 


Iv.all_ge.b lv.all_ge.b 


Than or Equal To 
B31. |. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). fe |} teas |). fi. Jf. J] tate ; a7 |i. ||. |e |e [fe | 0 
opcode Oxa D A B reserved opcode 0x12 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_ge.b rD,rA,rB 
Description: 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if all elements of rA are greater than 
or equal to the elements of rB; otherwise the compare flag is cleared. The compare flag is 
replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag «- rA[7:0] >= rB[7:0] && 
rA[15:8] >= rB[15:8] && 
rA[23:16] >= rB[23:16] && 
rA[31:24] >= rB[31:24] && 
rA[39:32] >= rB[39:32] && 
rA[47:40] >= rB[47:40] && 
rA[55:48] >= rB[55:48] && 
rA[63:56] >= rB[63:56] 

rD[63:0] <- repl( flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Iv.all_ge.h Vector Half-Word Elements All 


Iv.all_ge.h 
Greater Than or Equal To -§ 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajzo |]. |). fe |} teas |). ff. fe J] tate ; a7 |f. ||. |e le |e | 0 
opcode Oxa D A B reserved opcode 0x13 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_ge.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if all elements of rA are 
greater than or equal to the elements of rB; otherwise the compare flag is cleared. The 
compare flag is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 


64-bit Implementation: 


flag - rA[15:0] >= rB[15:0] && 
rA[31:16] >= rB[31:16] && 
rA[47:32] >= rB[47:32] && 
rA[63:48] >= rB[63:48] 
rD[63:0] <« repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements All Greater 


lv.all_gt.b ahaa 


lv.all_gt.b 


31 ae hse ee Hil 2625 scilliew’| fs 2120 Par eaes 1615 = Wells 1110 i 87 


opcode Oxa D A B reserved opcode 0x14 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.all_gt.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if all elements of rA are greater than 
the elements of rB; otherwise the compare flag is cleared. The compare flag is replicated 
into all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[7:0] > rB[7:0] && 
rA[15:8] > rB[15:8] && 
rA[23:16] > rB[23:16] && 
rA[31:24] > rB[31:24] && 
rA[39:32] > rB[39:32] && 
rA[47:40] > rB[47:40] && 
rA[55:48] > rB[55:48] && 
rA[63:56] > rB[63:56] 

rD[63:0] <- repl(flag) 

Exceptions: 

None 


Instruction Class 
ORVDX64 I 


www.opencores.org 1.2-1 175 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Vector Half-Word Elements All 


Iv.all_gt.h Iv.all_gt.h 


Greater Than 
B31}. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjao |]. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x15 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_gt.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if all elements of rA are 
greater than the elements of rB; otherwise the compare flag is cleared. The compare flag 
is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 


64-bit Implementation: 


flag «- rA[15:0] > rB[15:0] && 
rA[31:16] > rB[31:16] && 
rA[47:32] > rB[47:32] && 
rA[63:48] > rB[63:48] 
rD[63:0] « repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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lv.all_le.b Vector Byte Elements All Less Than Iv.all_le.b 


or Equal To 
B31}. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjeo |]. |). tf. |} teas |. ff. Jf. |] tate ; a7 |i. fe fe le |e | 0 
opcode Oxa D A B reserved opcode 0x16 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_le.b rD,rA,rB 
Description: 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if all elements of rA are less than or 
equal to the elements of rB; otherwise the compare flag is cleared. The compare flag is 
replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag «- rA[7:0] <= rB[7:0] && 
rA[15:8] <= rB[15:8] && 
rA[23:16] <= rB[23:16] && 
rA[31:24] <= rB[31:24] && 
rA[39:32] <= rB[39:32] && 
rA[47:40] <= rB[47:40] && 
rA[55:48] <= rB[55:48] && 
rA[63:56] <= rB[63:56] 

rD[63:0] <- repl( flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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lv.all_ eh Vector Half-Word Elements All Less Iv.all leh 


Than or Equal To 
B31. |. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). fe |} teas |). fi. Jf. J] tate ; g7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x17 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_le.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if all elements of rA are 
less than or equal to the elements of rB; otherwise the compare flag is cleared. 
The compare flag is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 


64-bit Implementation: 


flag - rA[15:0] <= rB[15:0] && 
rA[31:16] <= rB[31:16] && 
rA[47:32] <= rB[47:32] && 
rA[63:48] <= rB[63:48] 
rD[63:0] « repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Iv.all_It.b Vector Byte Elements All Less Than lv.all_It.b 


31 ||. |). Jf. |]. |] 26i@5 ff. Jf. Je) 2tjzo yf. fe fe |} teas |). ff. fe |] tate B77 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x18 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.all_lt.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if all elements of rA are less than the 
elements of rB; otherwise the compare flag is cleared. The compare flag is replicated into 
all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag - rA[7:0] < rB[7:0] && 
rA[15:8] < rB[15:8] && 
rA[23:16] < rB[23:16] && 
rA[31:24] < rB[31:24] && 
rA[39:32] < rB[39:32] && 
rA[47:40] < rB[47:40] && 
rA[55:48] < rB[55:48] && 
rA[63:56] < rB[63:56] 

rD[63:0] < repl(flag) 

Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.all_It.h Vector Half-Word Elements All Less 


Iv.all_It.h 
Than 
B31. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjeo |]. |). fe |} teas |. ff. Jf. J] tate ; B77 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x19 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_lt.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if all elements of rA are 
less than the elements of rB; otherwise the compare flag is cleared. The compare flag is 
replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag «- rA[15:0] < rB[15:0] && 
rA[31:16] < rB[31:16] && 
rA[47:32] < rB[47:32] && 
rA[63:48] < rB[63:48] 
rD[63:0] « repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements All Not 


lv.all_ne.b Equal 


lv.all_ne.b 


31 ae hse ee Hil 2625 scilliew’| fs 2120 Par eaes 1615 = Wells 1110 i 87 


opcode Oxa D A B reserved opcode Ox1a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.all_ne.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if all corresponding elements are not 
equal; otherwise the compare flag is cleared. The compare flag is replicated into all bit 
positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[7:0] != rB[7:0] && 
rA[15:8] != rB[15:8] && 
rA[23:16] != rB[23:16] && 
rA[31:24] != rB[31:24] && 
rA[39:32] != rB[39:32] && 
rA[47:40] != rB[47:40] && 
rA[55:48] != rB[55:48] && 
= 


rA[63:56] != rB[63:56] 
rD[63:0] <- repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements All 


Iv.all_ne.h Iv.all_ne.h 
- Not Equal — 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2tjzo |]. |). fe |} teas |). fi. Jf. J] tate ; a7 |i. ||. He le [fe | 0 
opcode Oxa D A B reserved opcode 0x1b 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.all_ne.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if all corresponding 
elements are not equal; otherwise the compare flag is _ cleared. 
The compare flag is replicated into all bit positions of general-purpose register rD. 
32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[15:0] != rB[15:0] && 
rA[31:16] != rB[31:16] && 
rA[47:32] != rB[47:32] && 
rA[63:48] != rB[63:48] 

rD[63:0] < repl(flag) 

Exceptions: 

None 


Instruction Class 
ORVDX64 I 





www.opencores.org 1.2-1 182 of 357 


OpenCores 


lv.and 
31 ||. |}. Jf. |i. || 26/25 
opcode Oxa D 
6 bits 5 bits 
Format: 


lv.and rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual September 7, 2017 
Vector And Iv.and 
21/20 =|]. |. ||. 1615 ||. |]. ||. 1110 ; 87 lf. |). |. {fe fle |. 0 
A B reserved opcode 0x38 
5 bits 5 bits 3 bits 8 bits 


The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical AND operation. The result is placed into general- 


purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[63:0] <— rA[63:0] AND rB[63:0] 


Exceptions: 


None 
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ORVDX64 I 
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Iv.any_eq.b Vector Byte Elements Any Equallv.any_eq.b 


31 ||. |]. Jf. |]. |} 26i@5 ff. Jf. f}e |) 2tjeo yf. fe fe |} teas). ff. fe |] tate B77 |i. ||. |e lye {fe | 0 
opcode Oxa D A B reserved opcode 0x20 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.any_eq.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if any two corresponding elements 
are equal; otherwise the compare flag is cleared. The compare flag is replicated into all 
bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag - rA[7:0] == rB[7:0] || 
rA[15:8] == rB[15:8] || 
rA[23:16] == rB[23:16] || 
rA[31:24] == rB[31:24] || 
rA[39:32] == rB[39:32] || 
rA[47:40] == rB[47:40] || 
rA[55:48] == rB[55:48] | | 


rA[63:56] == rB[63:56] 
rD[63:0] < repl( flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Any 
Iv.any_eq.h Iv. 


any_eq.h 
Equal 
B31}. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjzo |]. |). fe |} teas |. ff. |. J] tate a7 |i. |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x21 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_eq.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if any two corresponding 
elements are equal; otherwise the compare flag is cleared. The compare flag is replicated 
into all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag - rA[15:0] == rB[15:0] || 
rA[31:16] == rB[31:16] || 
rA[47:32] == rB[47:32] || 
rA[63:48] == rB[63:48] 
rD[63:0] < repl(flag) 
Exceptions: 
None 
Instruction Class 
ORVDX64 I 
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Vector Byte Elements Any 


Iv.any_ge.b Iv.any_ge.b 


Greater Than or Equal To 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajzo |]. |). fe |} teas |). ff. fe J] tate ; a7 |f. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x22 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_ge.b rD,rA,rB 
Description: 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if any element of rA is greater than 
or equal to the corresponding element of rB; otherwise the compare flag is cleared. The 
compare flag is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag «- rA[7:0] >= rB[7:0] || 
rA[15:8] >= rB[15:8] | 
rA[23:16] >= rB[23:16] 
rA[31:24] >= rB[31:24] 
rA[39:32] >= rB[39:32] 
rA[47:40] >= rB[47:40] 
rA[55:48] >= rB[55:48] 
rA[63:56] >= rB[63:56] 

rD[63:0] <- repl( flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Any lv 


Iv.any_ge.h any_ge.h 
y¥_S Greater Than or Equal To y¥_S 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajzo |]. |). fe |} teas |). ff. fe J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x23 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_ge.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if any element of rA is 
greater than or equal to the corresponding element of rB; otherwise the compare flag is 


cleared. The compare flag is replicated into all bit positions of general-purpose register 
rD. 


32-bit Implementation: 

N/A 

64-bit Implementation: 

flag - rA[15:0] >= rB[15:0] || 
rA[31:16] >= rB[31:16] || 
rA[47:32] >= rB[47:32] || 
rA[63:48] >= rB[63:48] 

rD[63:0] <- repl( flag) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Any 


Iv.any_gt.b Iv.any_gt.b 


Greater Than 
B31}. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjao |]. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. ||. fe le [fe | 0 
opcode Oxa D A B reserved opcode 0x24 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.any_gt.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if any element of rA is greater than 
the corresponding element of rB; otherwise the compare flag is cleared. The compare flag 
is replicated into all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[7:0] > rB[7:0] || 
rA[15:8] > rB[15:8] || 
rA[23:16] > rB[23:16] || 
rA[31:24] > rB[31:24] || 
rA[39:32] > rB[39:32] || 
rA[47:40] > rB[47:40] || 
rA[55:48] > rB[55:48] || 
rA[63:56] > rB[63:56] 

rD[63:0] <- repl(flag) 

Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Any 
Iv.any_gt.h Iv.any_gt.h 
y_& Greater Than y¥_§ 
B31}. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjao |]. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x25 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_gt.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if any element of rA is 
greater than the corresponding element of rB; otherwise the compare flag is cleared. The 
compare flag is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag - rA[15:0] > rB[15:0] || 
rA[31:16] > rB[31:16] || 
rA[47:32] > rB[l47:32] || 
rA[63:48] > rB[63:48] 
rD[63:0] « repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Any Less 
Iv.any_le.b Iv.any_le.b 
Than or Equal To 
31 se (lise Hdl se) Ke 2625 Aillliey | hs 2120 ae |e ts 1615 Pal eal 1110 7 BI7 J). Te Te Ue fe de 0 
opcode Oxa D A B reserved opcode 0x26 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_le.b rD,rA,rB 
Description: 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if any element of rA is less than or 
equal to the corresponding element of rB; otherwise the compare flag is cleared. The 
compare flag is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag «- rA[7:0] <= rB[7:0] || 
rA[15:8] <= rB[15:8] | 
rA[23:16] <= rB[23:16] 
rA[31:24] <= rB[31:24] 
rA[39:32] <= rB[39:32] 
rA[47:40] <= rB[47:40] 
rA[55:48] <= rB[55:48] 
rA[63:56] <= rB[63:56] 

rD[63:0] <- repl( flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Any lv 


Iv.any_le.h any_le.h 
y_ Less Than or Equal To y— 
B31}. |. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). |. |} teas |. ff. fe |] tate ; a7 fe |. |e le |e | 0 
opcode Oxa D A B reserved opcode 0x27 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_le.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if any element of rA is 
less than or equal to the corresponding element of rB; otherwise the compare flag is 


cleared. The compare flag is replicated into all bit positions of general-purpose register 
rD. 


32-bit Implementation: 

N/A 

64-bit Implementation: 

flag - rA[15:0] <= rB[15:0] || 
rA[31:16] <= rB[31:16] || 
rA[47:32] <= rB[47:32] || 
rA[63:48] <= rB[63:48] 

rD[63:0] <- repl( flag) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Any Less 


Iv.any_It.b Iv.any_It.b 
y- Than y- 
31 elle dpe df} 26250 Jd). Tq} 2u20 Jf. |). ye} elas |e). J}. J) ante : ai7 |}. |i. |i. jf. ly. Hy. 0 
opcode Oxa D A B reserved opcode 0x28 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.any_lt.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if any element of rA is less than the 
corresponding element of rB; otherwise the compare flag is cleared. The compare flag is 
replicated into all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[7:0] < rB[7:0] || 
rA[15:8] < rB[15:8] || 
rA[23:16] < rB[23:16] || 
rA[31:24] < rB[31:24] || 
rA[39:32] < rB[39:32] || 
rA[47:40] < rB[47:40] || 
rA[55:48] < rB[55:48] || 
rA[63:56] < rB[63:56] 

rD[63:0] <- repl(flag) 

Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements An 
Iv.any_It.h y Iv.any_It.h 
Less Than 
B31}. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjzo |]. |). fe |} teas |. ff. Jf. |] tate ; a7 fe |. He le [fe | 0 
opcode Oxa D A B reserved opcode 0x29 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_lt.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if any element of rA is 
less than the corresponding element of rB; otherwise the compare flag is cleared. The 
compare flag is replicated into all bit positions of general-purpose register rD. 


32-bit Implementation: 
N/A 
64-bit Implementation: 


flag - rA[15:0] < rB[15:0] || 
rA[31:16] < rB[31:16] || 
rA[47:32] < rB[l47:32] || 
rA[63:48] < rB[63:48] 
rD[63:0] « repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Any Not 


Iv.any_ne.b Equal 


Iv.any_ne.b 


31 ae hse ee Hil 2625 scilliew’| fs 2120 Par eaes 1615 = Wells 1110 i 87 


opcode Oxa D A B reserved opcode 0x2a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.any_ne.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. The compare flag is set if any two corresponding elements 
are not equal; otherwise the compare flag is cleared. The compare flag is replicated into 
all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag «- rA[7:0] != rB[7:0] || 
rA[15:8] != rB[15:8] || 
rA[23:16] != rB[23:16] | | 
rA[31:24] != rB[31:24] || 
rA[39:32] != rB[39:32] || 
rA[47:40] != rB[47:40] || 
rA[55:48] != rB[55:48] || 
= 


rA[63:56] != rB[63:56] 
rD[63:0] <- repl(flag) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Any 
Iv.any_ne.h Iv.any_ne.h 
Not Equal 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2tjzo |]. |). fe |} teas |). fi. Jf. J] tate ; a7 |i. |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x2b 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.any_ne.h rD,rA,rB 


Description: 


All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. The compare flag is set if any two corresponding 
elements are not equal; otherwise the compare flag is cleared. The compare flag is 
replicated into all bit positions of general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


flag - rA[15:0] != rB[15:0] || 
rA[31:16] != rB[31:16] || 
rA[47:32] != rB[47:32] || 
rA[63:48] != rB[63:48] 

rD[63:0] < repl(flag) 

Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.avg.b 


31 


opcode Oxa 
6 bits 


Format: 


2625 


OpenRISC 1000 Architecture Manual 


September 7, 2017 


Vector Byte Elements Average Iv.avg.b 

{fe ff. |] 21/20 1615 3... 1110 ; 87 Hedy. fe {fe le |. 0) 
D A B reserved opcode 0x39 

5 bits 5 bits 5 bits 3 bits 8 bits 


lv.avg.b rD,rA,rB 


Description: 


The byte elements of general-purpose register rA are added to the byte elements of 
general-purpose register rB, and the sum is shifted right by one to form the result 
elements. The result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[ 7:0] 


rD[15: 
rD[23: 
rD[ 31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


www.opencores.org 


8 | 
16] 
24] 
32] 
4Q] 
48] 
56] 


oP a a a ah at 





(rA[7:0] 


(rA[15: 
(rA[23: 
(rA[321: 
(rA[39: 
(rA[47: 
(rA[55: 
(rA[63: 


8] 

16] 
24] 
32] 
40] 
48] 
56] 


rB[7:0]) >> 1 
rB[15:8]) >> 1 
rB[23:16]) >> 1 
rB[31:24]) >> 1 
rB[39:32]) >> 1 
rB[47:40]) >> 1 
rB[55:48]) >> 1 
rB[63:56]) >> 1 
Instruction Class 
ORVDX64 I 
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lv.avg.h Vector Half-Word Elements Average lv.avg.h 


31 ||. |). Jf. |]. |] 26i@5 ff. Jf. f]e |) 2tjao yf. tye fe |} teas |). ff. fe J] tate a7 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x3a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.avg.h rD,rA,rB 

Description: 

The half-word elements of general-purpose register rA are added to the half-word 
elements of general-purpose register rB, and the sum is shifted right by one to form the 
result elements. The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < (rA[15:0] + rB[15:0]) >> 41 
rD[31:16] < (rA[31:16] + rB[31:16]) >> 1 
rD[47:32] < (rA[47:32] + rB[47:32]) >> 1 
rD[63:48] < (rA[63:48] + rB[63:48]) >> 1 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.cmp_eq.b 


31 2625 2120 


opcode Oxa 
6 bits 5 bits 
Format: 


lv.cmp_eq.b rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual 


Vector Byte Elements Compare 


September 7, 2017 


Equal Iv.cmp_eq.b 


1615 1110 


reserved 
3 bits 


opcode 0x40 


5 bits 5 bits 8 bits 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. Bits of the element in general-purpose register rD are set if 
the two corresponding compared elements are equal; otherwise the element bits are 


cleared. 
32-bit Implementation: 
N/A 


64-bit Implementation: 


rD[ 7:0] - repl(rA[7:0] == rB[7:0]) 
rD[15:8] < repl(rA[15:8] == rB[15:8]) 
rD[23:16] -— repl(rA[23:16] == rB[23:16]) 
rD[31:24] «- repl(rA[31:24] == rB[31:24]) 
rD[39:32] <— repl(rA[39:32] == rB[39:32]) 
rD[47:40] < repl(rA[47:40] == rB[47:40]) 
rD[55:48] <- repl(rA[55:48] == rB[55:48]) 
rD[63:56] -— repl(rA[63:56] == rB[63:56]) 
Exceptions: 

None 
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Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


Iv.cmp_eq.h Iv.cmp_eq.h 


Compare Equal 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2tjao |]. |). fe |} teas |. |i. Jf. |] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x41 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.cmp_eq.h rD,rA,rB 

Description: 

All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. Bits of the element in general-purpose register 
rD are set if the two corresponding compared elements are equal; otherwise the element 
bits are cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] <« repl(rA[15:0] == rB[15:0]) 

rD[31:16] -— repl(rA[31:16] == rB[31:16]) 

rD[47:32] <- repl(rA[47:32] == rB[47:32]) 

rD[63:48] <- repl(rA[63:48] == rB[63:48]) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Compare 


lv.cmp_ge-b “Greater Than or Equal To 


lv.cmp_ge.b 


B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajzo |]. |). fe |} teas |). ff. fe J] tate ; a7 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x42 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.cmp_ge.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. Bits of the element in general-purpose register rD are set if 
the element in rA is greater than or equal to the element in rB; otherwise the element bits 
are cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 


rD[ 7:0] - repl(rA[7:0] >= rB[7:0]) 


rD[15:8] < repl(rA[15:8] >= rB[15:8]) 
rD[23:16] -— repl(rA[23:16] >= rB[23:16]) 
rD[31:24] -— repl(rA[31:24] >= rB[31:24]) 
rD[39:32] -— repl(rA[39:32] >= rB[39:32]) 
rD[47:40] <- repl(rA[47:40] >= rB[47:40]) 
rD[55:48] - repl(rA[55:48] >= rB[55:48]) 
rD[63:56] - repl(rA[63:56] >= rB[63:56]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 
Iv.cmp_ge.h Compare Greater Than or — lv.cmp_ge.h 


Equal To 
B31]. |. |]. |]. |] 2625 |. Jf. |]. |) 2ajzo |]. |). |i. |} teas |. |i. Jf. |] afte a7 |i. ||. |]. le {le | 0 
opcode Oxa D A B reserved opcode 0x43 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.cmp_ge.h rD,rA,rB 

Description: 

All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. Bits of the element in general-purpose register 
rD are set if the element in rA is greater than or equal to the element in rB; otherwise the 
element bits are cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] < repl(rA[15:0] >= rB[15:0]) 

rD[31:16] - repl(rA[31:16] >= rB[31:16]) 

rD[47:32] <— repl(rA[47:32] >= rB[47:32]) 

rD[63:48] -— repl(rA[63:48] >= rB[63:48] ) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Compare 


lv.cmp_gt.b Iv.cmp_gt.b 


Greater Than 
B31}. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjao |]. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x44 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.cmp_gt.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. Bits of the element in general-purpose register rD are set if 
the element in rA is greater than the element in rB; otherwise the element bits are cleared. 
32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[ 7:0] - repl(rA[7:0] > rB[L7:0]) 
rD[15:8] < repl(rA[15:8] > rB[15:8]) 
rD[23:16] -— repl(rA[23:16] > rB[23:16] ) 
rD[31:24] -— repl(rA[31:24] > rB[31:24]) 
rD[39:32] -— repl(rA[39:32] > rB[39:32]) 
rD[47:40] <- repl(rA[47:40] > rB[47:40]) 
rD[55:48] - repl(rA[55:48] > rB[55:48] ) 
rD[63:56] - repl(rA[63:56] > rB[63:56] ) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


Iv.cmp_gt.h Compare Greater Than 


Iv.cmp_gt.h 


B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2tjao |]. |). fe |} teas |. ff. fe J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x45 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.cmp_gt.h rD,rA,rB 

Description: 

All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. Bits of the element in general-purpose register 
rD are set if the element in rA is greater than the element in rB; otherwise the element bits 
are cleared. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < repl(rA[15:0] > rB[15:0]) 
rD[31:16] -— repl(rA[31:16] > rB[31:16] ) 
rD[ 47:32] -— repl(rA[47:32] > rB[47:32]) 
rD[63:48] -— repl(rA[63:48] > rB[63:48] ) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Compare 


Iv.cmp_le.b Iv.cmp_le.b 


Less Than or Equal To 
B31}. |. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). |. |} teas |. ff. fe |] tate ; B77 |i. ||. |e |e [fe | 0 
opcode Oxa D A B reserved opcode 0x46 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.cmp_le.b rD,rA,rB 

Description: 

All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. Bits of the element in general-purpose register rD are set if 
the element in rA is less than or equal to the element in rB; otherwise the element bits are 
cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 


rD[ 7:0] - repl(rA[7:0] <= rB[7:0]) 


rD[15:8] < repl(rA[15:8] <= rB[15:8]) 
rD[ 23:16] -— repl(rA[23:16] <= rB[23:16]) 
rD[31:24] -— repl(rA[31:24] <= rB[31:24]) 
rD[39:32] <— repl(rA[39:32] <= rB[39:32]) 
rD[ 47:40] <- repl(rA[47:40] <= rB[47:40]) 
rD[55:48] - repl(rA[55:48] <= rB[55:48]) 
rD[63:56] - repl(rA[63:56] <= rB[63:56]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


lv.cmp_le.h Compare Less Than or Equal To 


Iv.cmp_le.h 


B31}. |. |]. |]. |] 2625 |. Jf. |]. |) 2ajeo |]. |). fe |} teas |. fi. fe J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x47 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.cmp_le.h rD,rA,rB 

Description: 

All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. Bits of the element in general-purpose register 
rD are set if the element in rA is less than or equal to the element in rB; otherwise the 
element bits are cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] < repl(rA[15:0] <= rB[15:0]) 

rD[31:16] -— repl(rA[31:16] <= rB[31:16]) 

rD[ 47:32] <— repl(rA[47:32] <= rB[47:32]) 

rD[63:48] -— repl(rA[63:48] <= rB[63:48]) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Iv.cmp_It.b 


31 2625 2120 


opcode Oxa 
6 bits 5 bits 
Format: 


lv.cmp_lt.b rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual 


Vector Byte Elements Compare 


September 7, 2017 


Iv.cmp_lIt.b 


Less Than 
.|f. |. |) 165 Jf. |]. 1110 a7 |i. ||. |e le [fe | 
A B reserved opcode 0x48 
5 bits 5 bits 3 bits 8 bits 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. Bits of the element in general-purpose register rD are set if 
the element in rA is less than the element in rB; otherwise the element bits are cleared. 


32-bit Implementation: 
N/A 


64-bit Implementation: 


rD[ 7:0] - repl(rA[7:0] <= rB[7:0]) 
rD[15:8] << repl(rA[15:8] <= rB[15:8]) 
rD[23:16] -— repl(rA[23:16] <= rB[23:16]) 
rD[31:24] - repl(rA[31:24] <= rB[31:24]) 
rD[39:32] -— repl(rA[39:32] <= rB[39:32]) 
rD[47:40] <- repl(rA[47:40] <= rB[47:40]) 
rD[55:48] - repl(rA[55:48] <= rB[55:48]) 
rD[63:56] -— repl(rA[63:56] <= rB[63:56]) 
Exceptions: 

None 
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Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


Iv.cmp_lIt.h Compare Less Than 


Iv.cmp_It.h 


B31}. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjzo |]. |). fe |} teas |. fi. Jf. J] tate ; a7 |i. ||. He le [fe | 0 
opcode Oxa D A B reserved opcode 0x49 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.cmp_l1t.h rD,rA,rB 

Description: 

All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. Bits of the element in general-purpose register 
rD are set if the element in rA is less than the element in rB; otherwise the element bits 
are cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] < repl(rA[15:0] <= rB[15:0]) 

rD[31:16] -— repl(rA[31:16] <= rB[31:16]) 

rD[ 47:32] <— repl(rA[47:32] <= rB[47:32]) 

rD[63:48] -— repl(rA[63:48] <= rB[63:48]) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Iv.cmp_ne.b 


31 2625 ||. ||. |). || 21/20 
opcode Oxa D 
6 bits 5 bits 
Format: 


lv.cmp_ne.b rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual 


Vector Byte Elements Compare 


September 7, 2017 


Iv.cmp_ne.b 


Not Equal 
- {fe |. |) 1615 Jf. |]. 1110 a7 |i. ||. |e le [fe | 
A B reserved opcode 0x4a 
5 bits 5 bits 3 bits 8 bits 


All byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB. Bits of the element in general-purpose register rD are set if 
the two corresponding compared elements are not equal; otherwise the element bits are 


cleared. 
32-bit Implementation: 
N/A 


64-bit Implementation: 


rD[ 7:0] - repl(rA[7:0] != rB[7:0]) 
rD[15:8] < repl(rA[15:8] != rB[15:8]) 
rD[23:16] -— repl(rA[23:16] != rB[23:16]) 
rD[31:24] « repl(rA[31:24] != rB[31:24]) 
rD[39:32] <— repl(rA[39:32] != rB[39:32]) 
rD[47:40] < repl(rA[47:40] != rB[47:40]) 
rD[55:48] «- repl(rA[55:48] != rB[55:48]) 
rD[63:56] <— repl(rA[63:56] != rB[63:56]) 
Exceptions: 

None 
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Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


Iv.cmp_ne.h Iv.cmp_ne.h 


Compare Not Equal 
B31. |. |]. |]. |] 2625) |. Jf. |]. |) 2ajzo |]. |). fe |} teas |). ff. fe J] tate ; a7 |i. ||. |e |e [fe | 0 
opcode Oxa D A B reserved opcode 0x4b 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.cmp_ne.h rD,rA,rB 

Description: 

All half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB. Bits of the element in general-purpose register 
rD are set if the two corresponding compared elements are not equal; otherwise the 
element bits are cleared. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] <« repl(rA[15:0] != rB[15:0]) 

rD[31:16] -— repl(rA[31:16] != rB[31:16]) 


rD[47:32] <-— repl(rA[47:32] != rB[l47:32]) 
rD[63:48] <- repl(rA[63:48] != rB[63:48]) 


Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Reserved for Custom Vector 


Iv.cust1 ; Iv.cust1 
Instructions 
31 df. dhe |] 26125 J}. |e Je fe fe Je Te Tf Te He Te Te Efe He He - 87 . . 43 i. 0 
opcode Oxa reserved opcode Oxc reserved 
6 bits 18 bits 4 bits 4 bits 
Format: 
lv.cust1 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but instead by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORVDX64 II 
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Reserved for Custom Vector 


Iv.cust2 ; Iv.cust2 
Instructions 
31 eet ae | 2625 fe He de Je He Te He Te Te Te The Te He Te Pe TP 87 . : 43 .|l. 0 
opcode Oxa reserved opcode Oxd reserved 
6 bits 18 bits 4 bits 4 bits 
Format: 
lv.cust2 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but instead by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORVDX64 II 
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Reserved for Custom Vector 


Iv.cust3 ; Iv.cust3 
Instructions 
31 df. dhe |] 26125 J}. |e Je fe fe Je Te Tf Te He Te Te Efe He He - 87 . . 43 i. 0 
opcode Oxa reserved opcode Oxe reserved 
6 bits 18 bits 4 bits 4 bits 
Format: 
lv.cust3 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but instead by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORVDX64 II 


www.opencores.org 1.2-1 212 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Reserved for Custom Vector 


Iv.cust4 ; Iv.cust4 
Instructions 
31 {fe dpe de 2625. |e]. Te de He Te Te Te Te Te Te He Te He 87 : . 43 if. 0 
opcode Oxa reserved opcode Oxf reserved 
6 bits 18 bits 4 bits 4 bits 
Format: 
lv.cust4 
Description: 


This fake instruction only allocates instruction set space for custom instructions. Custom 
instructions are those that are not defined by the architecture but instead by the 
implementation itself. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

N/A 


Exceptions: 


N/A 


Instruction Class 
ORVDX64 II 
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Vector Half-Word Elements 


Iv.madds.h : : lv.madds.h 
Multiply Add Signed Saturated 
B31}. |]. |]. |]. |] 2625 |. Jf. |]. |) 2ajeo |. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x54 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.madds.h rD,rA,rB 
Description: 


The signed half-word elements of general-purpose register rA are multiplied by the 
signed half-word elements of general-purpose register rB to form intermediate results. 
They are then added to the signed half-word VMAC elements to form the final results 
that are placed again in the VMAC registers. The intermediate result is placed into 
general-purpose register rD. If any of the final results exceeds the min/max value, it is 
saturated. 


Note: The ORVDX instruction set is not completely specified. This instruction is 
incorrectly specified in that VMAC is not defined and implementation below does not 
match description. 
32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < sat32s(rA[15:0] * rB[15:0] + VMACLO[31:0]) 
rD[31:16] <— sat32s(rA[31:16] * rB[31:16] + VMACLO[63:32]) 
rD[ 47:32] <— sat32s(rA[47:32] * rB[47:32] + VMACHI[31:0]) 
rD[63:48] <— sat32s(rA[63:48] * rB[63:48] + VMACHI[63:32]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.max.b Vector Byte Elements Maximum 


31 


opcode Oxa 
6 bits 


Format: 


2625 


5 bits 


lv.max.b rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual 


2120 


1615 


5 bits 


5 bits 


1110 


reserved 
3 bits 


Septem 


ber 7, 2017 


lv.max.b 


opcode 0x55 
8 bits 


The byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB, and the larger elements are selected to form the result 
elements. The result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[ 7:0] 


rD[15: 
rD[23: 
rD[ 31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


Lot or th te ot 


rA[7:0] 


rA[15: 
rAl23: 
rA[31: 
rA[39: 
rA[47: 
rA[55: 
rA[63: 
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8 | 
16] 
24] 
32] 
40] 
48] 
56] 


VVVVVVVV 


rB[L 7:0] 


rB[15: 
rB[23: 
rB[ 31: 
rB[39: 
rB[ 47: 
rB[55: 
rB[63: 


Instruction Class 
ORVDX64 I 


8] 

16] 
24] 
32] 
4Q] 
48] 
56] 
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rA[7:0] 


rA[15: 
FAL 23% 
rA[31: 
rA[39: 
rA[47: 
rA[55: 
rA[63: 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


rB[7: 
rB[15 


rB[23: 
rB[ 31: 
rB[39: 
rB[ 47: 
rb 55: 
rB[63: 


0] 
:8] 
16] 
24] 
32] 
40] 
48] 
56] 


215 of 357 


OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Vector Half-Word Elements 


lv.max.h , Iv.max.h 
Maximum 
31) Jf. fi if. ey} 2625 TI de} 2tiao LTP }} teas Jf.) Fe TP ato : ai7 |}. |i. |i. ||. lel 0 
opcode Oxa D A B reserved opcode 0x56 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.max.h rD,rA,rB 

Description: 

The half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB, and the larger elements are selected to form the 
result elements. The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rA[15:0] > rB[15:0] ? rA[15:0] : rB[15:0] 
rD[31:16] < rA[31:16] > rB[31:16] ? rA[31:16] : rB[31:16] 
rD[47:32] <— rA[47:32] > rB[47:32] ? rA[47:32] : rB[47:32] 
rD[63:48] <— rA[63:48] > rB[63:48] ? rA[63:48] : rB[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.merge.b 


31 


opcode Oxa 
6 bits 


Format: 


2625 


OpenRISC 1000 Architecture Manual 


September 7, 2017 


Vector Byte Elements Merge _ lv.merge.b 


5 bits 


2120 


lv.merge.b rD,rA,rB 


Description: 


1615 


5 bits 5 bits 


1110 ; 87 Hedy. fe fhe fe Uf. 0) 


reserved opcode 0x57 
3 bits 8 bits 


The byte elements of the lower half of the general-purpose register rA are combined with 
the byte elements of the lower half of general-purpose register rB in such a way that the 
lowest element is from rB, the second element from rA, the third again from rB etc. The 
result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[7:0] 


rD[15: 
rD[23: 
rD[31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


tt tT tT tf tT T T 


rB[7:0] 


rA[i5: 
rB[23: 
rA[31: 
rB[39: 
rA[47: 
rB[55: 
rA[63: 
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16] 
24] 
32] 
40] 
48] 
56] 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


lv.merge.h lv.merge.h 
Merge 
B31. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjeo |]. |). fe |] teas |). ff. Jf. J] tate ; a7 |i. |. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x58 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.merge.h rD,rA,rB 

Description: 

The half-word elements of the lower half of the general-purpose register rA are combined 
with the half-word elements of the lower half of general-purpose register rB in such a 
way that the lowest element is from rB, the second element from rA, the third again from 
rB etc. The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rB[15:0] 
rD[31:16] <— rA[31:16] 
rD[47:32] < rB[47:32] 
rD[63:48] < rA[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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lv.min.b 


31 


opcode Oxa 
6 bits 


Format: 


2625 


Vector Byte Elements Minimum 


5 bits 


lv.min.b rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual 


2120 


1615 


5 bits 


5 bits 


1110 


reserved 
3 bits 


Septem 


ber 7, 2017 


lv.min.b 


opcode 0x59 
8 bits 


The byte elements of general-purpose register rA are compared to the byte elements of 
general-purpose register rB, and the smaller elements are selected to form the result 
elements. The result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[ 7:0] 


rD[15: 
rD[23: 
rD[ 31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


Lot or th te ot 


rA[7:0] 


rA[15: 
rAl23: 
rA[31: 
rA[39: 
rA[47: 
rA[55: 
rA[63: 


www.opencores.org 





8 | 
16] 
24] 
32] 
40] 
48] 
56] 


NN ANKANANA 


rB[L 7:0] 


rB[15: 
rB[23: 
rB[ 321: 
rB[39: 
rB[ 47: 
rB[55: 
rB[63: 


Instruction Class 
ORVDX64 I 
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16] 
24] 
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48] 
56] 
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rA[7:0] 


rA[15: 
FAL 23% 
rA[31: 
rA[39: 
rA[47: 
rA[55: 
rA[63: 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


rB[7: 
rB[15 


rB[23: 
rB[ 31: 
rB[39: 
rB[ 47: 
rb 55: 
rB[63: 
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lv.min.h Vector Half-Word Elements Minimum lv.min.h 


31 ||. |). Jf. |]. |] 26i@5 |. Jf. f]e ) 2tjao yf. fe fe |} teas |). ff. Jf. J] tate B77 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x5a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.min.h rD,rA,rB 

Description: 

The half-word elements of general-purpose register rA are compared to the half-word 
elements of general-purpose register rB, and the smaller elements are selected to form the 
result elements. The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rA[15:0] < rB[15:0] ? rA[15:0] : rB[15:0] 
rD[31:16] -— rA[31:16] < rB[31:16] ? rA[31:16] : rB[31:16] 
rD[47:32] <— rA[47:32] < rB[47:32] ? rA[47:32] : rB[47:32] 
rD[63:48] < rA[63:48] < rB[63:48] ? rA[63:48] : rB[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 
Iv.msubs.h Multiply Subtract Signed Iv.msubs.h 


Saturated 
31]. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjeo |]. |). tf. |} teas |. fi. Jf. |] tate a7 |i. ||. |]. le {le | 0 
opcode Oxa D A B reserved opcode 0x5b 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 
lv.msubs.h rD,rA,rB 
Description: 


The signed half-word elements of general-purpose register rA are multiplied by the 
signed half-word elements of general-purpose register rB to form intermediate results. 
They are then subtracted from the signed half-word VMAC elements to form the final 
results that are placed again in the VMAC registers. The intermediate result is placed into 
general-purpose register rD. If any of the final results exceeds the min/max value, it is 
saturated. 


Note: The ORVDX instruction set is not completely specified. This instruction is 
incorrectly specified in that VMAC is not defined and implementation below does not 
match description. 
32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < sat32s(VMACLO[31:0] - rA[15:0] * rB[15:0]) 
rD[31:16] — sat32s(VMACLO[63:32] - rA[31:16] * rB[31:16]) 
rD[ 47:32] <— sat32s(VMACHI[31:0] - rA[47:32] * rB[47:32]) 
rD[63:48] — sat32s(VMACHI[63:32] - rA[63:48] * rB[63:48]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 


www.opencores.org 1.2-1 221 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Vector Half-Word Elements Multiply 


Iv.muls.h ‘ Iv.muls.h 
Signed Saturated 
B31. |]. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). i. |} teas |. ff. Jf. |] tate ; a7 |f. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x5c 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.muls.h rD, rA, rB 

Description: 

The signed half-word elements of general-purpose register rA are multiplied by the 
signed half-word elements of general-purpose register rB to form the results. The result is 
placed into general-purpose register rD. If any of the final results exceeds the min/max 
value, it is saturated. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] < sati6s(rA[15:0] * rB[15:0]) 

rD[31:16] - sati6s(rA[31:16] * rB[31:16]) 

rD[ 47:32] -— sati6s(rA[47:32] * rB[47:32]) 

rD[63:48] -— sati6s(rA[63:48] * rB[63:48] ) 

Exceptions: 


None 


Instruction Class 
ORVDX64 II 
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Iv.nand 
31 |. |}. J. fi. || 26/25 
opcode Oxa D 
6 bits 5 bits 
Format: 


lv.nand rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual September 7, 2017 
Vector Not And Iv.nand 
21/20 =|. |. ||. 1615 3... 1110 ; 87 Hel. fe fhe fe |. 0 
A B reserved opcode 0x5d 
5 bits 5 bits 3 bits 8 bits 


The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical NAND operation. The result is placed into 


general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[63:0] ~— rA[63:0] NAND rB[63:0] 


Exceptions: 


None 
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Instruction Class 
ORVDX64 I 
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lv.nor Vector Not Or lv.nor 
31) Jf df. ey} 262s TI ey} 2aiao Tey} gels yf.) Fe TP aato : aly |}. |i. |i. fet) 0 
opcode Oxa D A B reserved opcode 0x5e 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.nor rD,rA,rB 

Description: 

The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical NOR operation. The result is placed into general- 
purpose register rD. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] < rA[63:0] NOR rB[63:0] 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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lv.or 
31 ||. ||. J. |i. || 26/25 
opcode Oxa D 
6 bits 5 bits 
Format: 


lv.or rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual September 7, 2017 
Vector Or lv.or 
21/20 =|]. |. ||. 1615 ||. ||. ||. 1110 ; 87 lf. i]. fe {fe fle |. 0 
A B reserved opcode Ox5f 
5 bits 5 bits 3 bits 8 bits 


The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical OR operation. The result is placed into general- 


purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[63:0] < rA[63:0] OR rB[63:0] 


Exceptions: 


None 
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Instruction Class 
ORVDX64 I 
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Iv.pack.b Vector Byte Elements Pack Iv.pack.b 


31 ||. |). Jf. |]. |] 26@5 ff. Jf. |}. |) 2tjeo yf. |). fe |} teas |). ff. ]f. |] tate a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x60 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.pack.b rD,rA, rB 

Description: 

The lower half of the byte elements of the general-purpose register rA are truncated and 
combined with the lower half of the byte truncated elements of the general-purpose 
register rB in such a way that the lowest elements are from rB, and the highest elements 


from rA. The result elements are placed into general-purpose register rD. 


64-bit Implementation: 


rD[3:0] < rB[3:0] 
rD[ 7:4] - rB[11:8] 
rD[11:8] < rB[19:16] 
rD[15:12] < rB[27:24] 
rD[19:16] - rB[35:32] 
rD[ 23:20] < rB[43:40] 
rD[27:24] < rB[51:48] 
rD[31:28] - rB[59:56] 
rD[35:32] < rA[3:0] 
rD[39:36] < rA[11:8] 
rD[43:40] - rA[19:16] 
rD[47:44] < rA[27:24] 
rD[51:48] <— rA[35:32] 
rD[55:52] - rA[43:40] 
rD[59:56] <- rA[51:48] 
rD[63:60] < rA[59:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.pack.h Vector Half-word Elements Pack _ lv.pack.h 


31 


opcode Oxa 
6 bits 


Format: 


2625 


2120 


5 bits 


lv.pack.h rD,rA, rB 


Description: 


5 bits 


1615 


5 bits 


1110 


reserved opcode 0x61 
3 bits 8 bits 


The lower half of the half-word elements of the general-purpose register rA are truncated 
and combined with the lower half of the half-word truncated elements of the general- 
purpose register rB in such a way that the lowest elements are from rB, and the highest 
elements from rA. The result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[7:0] 


rD[15: 
rD[23: 
rD[31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 
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16] 
24] 
32] 
40] 
48] 
56] 


tt tt tt Tt ft 





rB[7:0] 

rB[23:16] 
rB[39:32] 
rB[55:48] 
rA[7:0] 

rA[23:16] 
rA[39:32] 
rA[55:48] 


Instruction Class 
ORVDX64 I 
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Iv.packs. 


31 


opcode Oxa 
6 bits 


Format: 


2625 


OpenRISC 1000 Architecture Manual 


2120 


5 bits 


lv.packs.b rD,rA,rB 


Description: 


b Vector Byte Elements Pack Signed 


September 7, 2017 


lv.packs.b 
Saturated P 
. if. fe 1615 87 fe. |e df. fle | 0 
A B opcode 0x62 
5 bits 5 bits 8 bits 


The lower half of the signed byte elements of the general-purpose register rA are 
truncated and combined with the lower half of the signed byte truncated elements of the 
general-purpose register rB in such a way that the lowest elements are from rB, and the 
highest elements from rA. If any truncated element exceeds a signed 4-bit value, it is 
saturated. The result elements are placed into general-purpose register rD. 


64-bit Implementation: 


rD[3:0] 
rD[ 7:4] 


rD[ 14: 
rD[15: 
rD[19: 
rD[23: 
rD[27: 
rD[ 31: 
rD[35: 
rD[39: 
rD[43: 
rD[47: 
rD[ 51: 
rD[55: 
rD[59: 
rD[63: 


Exceptions: 


None 


8] 

12] 
16] 
20] 
24] 
28] 
32] 
36] 
40] 
44] 
48] 
52] 
56] 
60] 


Tt tT tT tT Tt tt tt tt t tT 


sat4s(rB[7:0]) 


sat4s(rB[15: 
sat4s(rB[23: 
sat4s(rB[31: 
sat4s(rB[39: 
sat4s(rB[47: 
sat4s(rB[55: 
sat4s(rB[63: 


56 


sat4s(rA[7:0]) 


sat4s(rA[15: 
sat4s(rA[23: 
sat4s(rA[31: 
sat4s(rA[39: 
sat4s(rA[47: 
sat4s(rA[55: 
sat4s(rA[63: 
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ORVDX64 I 
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Iv.packs.h 


31 


opcode Oxa 
6 bits 


Format: 


2625 


OpenRISC 1000 Architecture Manual 


Vector Half-word Elements Pack 
Signed Saturated 


2120 


5 bits 


lv.packs.h rD,rA,rB 


Description: 


5 bits 


1615 


5 bits 


September 7, 2017 


Iv.packs.h 


opcode 0x63 
8 bits 


The lower half of the signed halfword elements of the general-purpose register rA are 
truncated and combined with the lower half of the signed half-word truncated elements of 
the general-purpose register rB in such a way that the lowest elements are from rB, and 
the highest elements from rA. If any truncated element exceeds a signed 8-bit value, it is 
saturated. The result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[7:0] 


rD[15: 
rD[23: 
rD[31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


tf ? Tt t tT 


sat8s(rB[15: 
sat8s(rB[31: 
sat8s(rB[47: 
sat8s(rB[63: 
sat8s(rA[15: 
sat8s(rA[31: 
sat8s(rA[47: 
sat8s(rA[63: 
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0]}) 
16] ) 
32]) 
48]) 


16]) 
32]) 
48]) 


Instruction Class 
ORVDX64 I 
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Iv.packus.b 


31 ae hse ee Hil 2625 


opcode Oxa 
6 bits 


Format: 
lv.packus.b 


Description: 


OpenRISC 1000 Architecture Manual 


Vector Byte Elements Pack 


Unsigned Saturated 


2120 


5 bits 


rD, rA, rB 


1615 


5 bits 


5 bits 


1110 


reserved 
3 bits 


September 7, 2017 


Iv.packus.b 
87 Ife. |e fe He | 0 
opcode 0x64 
8 bits 


The lower half of the unsigned byte elements of the general-purpose register rA are 
truncated and combined with the lower half of the unsigned byte truncated elements of 
the general-purpose register rB in such a way that the lowest elements are from rB, and 
the highest elements from rA. If any truncated element exceeds an unsigned 4-bit value, it 
is saturated. The result elements are placed into general-purpose register rD. 


64-bit Implementation: 


rD[3:0] 

rD[ 7:4] 

rD[11:8] 
rD[15:12] 
rD[19:16] 
rD[23:20] 
rD[ 27:24] 
rD[ 31:28] 
rD[35:32] 
rD[39:36] 
rD[ 43:40] 
rD[ 47:44] 
rD[51:48] 
rD[55:52] 
rD[59:56| 
rD[ 63:60] 


Tt Tt tT tT tt tt tt tt t tT 


Exceptions: 


None 


sat4u(rB[7:0]) 


sat4u(rB[15: 
sat4u(rB[23: 
sat4u(rB[31: 
sat4u(rB[39: 
sat4u(rB[47: 
sat4u(rB[55: 
sat4u(rB[63: 


56 


sat4u(rA[7:0]) 


sat4u(rA[15: 
sat4u(rA[23: 
sat4u(rA[31: 
sat4u(rA[39: 
sat4u(rA[47: 
sat4u(rA[55: 
sat4u(rA[63: 
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Iv.packus.h 


31 


opcode Oxa 
6 bits 


Format: 


2625 


OpenRISC 1000 Architecture Manual 


Vector Half-word Elements Pack 
Unsigned Saturated 


2120 


5 bits 


lv.packus.h rD,rA,rB 


Description: 


5 bits 


1615 


5 bits 


September 7, 2017 


Iv.packus.h 
87 fe. |e fe le | 0 
opcode 0x65 
8 bits 


The lower half of the unsigned halfword elements of the general-purpose register rA are 
truncated and combined with the lower half of the unsigned half-word truncated elements 
of the general-purpose register rB in such a way that the lowest elements are from rB, and 
the highest elements from rA. If any truncated element exceeds an unsigned 8-bit value, it 
is saturated. The result elements are placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[7:0] 


rD[15: 
rD[23: 
rD[31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8 | 
16] 
24] 
32] 
40] 
48] 
56] 


tf tT tf t tT 


sat8u(rB[15: 
sat8u(rB[31: 
sat8u(rB[47: 
sat8u(rB[63: 
sat8u(rA[15: 
sat8u(rA[31: 
sat8u(rA[47: 
sat8u(rA[63: 
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0]}) 
16] ) 
32]) 
48]) 


16]) 
32]) 
48]) 


Instruction Class 
ORVDX64 I 
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Iv.perm.n Vector Nibble Elements Permute lv.perm.n 


31 


opcode Oxa 
6 bits 


Format: 


2625 


2120 


5 bits 


lv.perm.n rD,rA, rB 


Description: 


5 bits 


1615 


1110 


5 bits 


opcode 0x66 
8 bits 


The 4-bit elements of general-purpose register rA are permuted according to the 
corresponding 4-bit values in general-purpose register rB. The result elements are placed 
into general-purpose register rD. 


64-bit Implementation: 


rD[3:0] 
rD[ 7:4] 


rD[11: 
rD[15: 
rD[19: 
rD[23: 
rD[27: 
rD[ 31: 
rD[35: 
rD[39: 
rD[ 43: 
rD[47: 
rD[51: 
rD[55: 
rD[59: 
rD[63: 


Exceptions: 


None 


8] 

12] 
16] 
20] 
24] 
28] 
32] 
36] 
40] 
44] 
48] 
52] 
56] 
60] 


- rA[rB[3:0]*4+3:rB[3:0]*4] 
rA[rB[7:4]*4+3:rB[7:4]*4] 
8]*4+3:rB[11:8]*4] 


tT PPT Port Tort Pot 7 Tt 


rA[rB[11: 
rA[rB[15: 
rA[rB[19: 
rA[rB[23: 
rA[rB[27: 
rA[rB[321: 
rA[rB[35: 
rA[rB[39: 
rA[rB[43: 
rA[rB[47: 
rA[rB[51: 
rA[rB[55: 
rA[rB[59: 
rA[rB[63: 
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12]*4+3: 
16]*4+3: 
20]*4+3: 
24]*4+3: 
28]*4+3: 
32]*4+3: 
36]*4+3: 
4Q]*4+3: 
44]*44+3: 
48]*4+3: 
52]*44+3: 
56]*4+3: 
60]*4+3: 


rB[15: 
rB[19: 
rB[23: 
rB[27: 
rB[ 31: 
rB[35: 
rB[39: 
rB[43: 
rB[ 47: 
rB[51: 
rB[55: 
rB[59: 
rB[63: 


12]*4] 
16]*4] 
20]*4] 
24]*4] 
28]*4] 
32]*4] 
36]*4] 
40]*4] 
44]*4] 
48]*4] 
52]*4] 
56]*4] 
60]*4] 


Instruction Class 
ORVDX64 I 
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lv.rl.b Vector Byte Elements Rotate Left Iv.rl.b 
B31}. |]. Jf. |]. |] 2625 |. Jf. |]. |) 2ajeo fe |). fe |} teas |). ff. Jf. |] tate a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x67 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.rl.b rD,rA,rB 

Description: 

The contents of byte elements of general-purpose register rA are rotated left by the 
number of bits specified in the lower 3 bits in each byte element of general-purpose 
register rB. The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[ 7:0] «- rA[7:0] rl rB[2:0] 
rD[15:8] < rA[15:8] rl rB[10:8] 
rD[23:16] < rA[23:16] rl rB[18:16] 
rD[31:24] <— rA[31:24] rl rB[26:24] 
rD[39:32] -— rA[39:32] rl rB[34:32] 
rD[47:40] < rA[47:40] rl rB[42:40] 
rD[55:48] < rA[55:48] rl rB[50:48] 
rD[63:56] -— rA[63:56] rl rB[58:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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lv.rl.h Vector Half-Word Elements Rotate Left lv.rl.h 


B31 ||. |). J]. |]. |] 2625 |. Jf. |]. |) 2tjeo |]. |). fe |} teas |). ff. Jf. |] tate a7 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x68 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.rl.h rD,rA,rB 

Description: 

The contents of half-word elements of general-purpose register rA are rotated left by the 
number of bits specified in the lower 4 bits in each half-word element of general-purpose 
register rB. The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] «< rA[15:0] rl rB[3:0] 
rD[31:16] - rA[31:16] rl rB[19:16] 
rD[47:32] < rA[47:32] rl rB[35:32] 
rD[63:48] <— rA[63:48] rl rB[51:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.sll Vector Shift Left Logical lv.sll 
B31 ||. |. |]. |]. |] 26125 |. Jf. fy}. |) 2tjeo if. |). fe |} teas |). ff. Jf. J] tate a7 |i. ||. |e le |e | 0 
opcode Oxa D A B reserved opcode 0x6b 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 

Format: 


lv.sll rD,rA,rB 

Description: 

The contents of general-purpose register rA are shifted left by the number of bits 
specified in the lower 4 bits in each byte element of general-purpose register rB, inserting 


zeros into the low-order bits of rD. The result elements are placed into general-purpose 
register rD. 


Note: The ORVDX instruction set is not completely specified. This instruction is 
incorrectly specified in that implementation below does not operate in a vector fashion 
and no element size is specified in the mnemonic. It may be a remnant of a template or 
lv.sll.b. 
32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] < rA[63:0] << rB[2:0] 
Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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lv.sll.b Vector Byte Elements Shift Left Logical lv.sll.b 


31. |/. Jf. |]. |] 26i@5 fe Jf. |}. |) 2tjao ff). df. |} teas |). ff. fe |] tate B77 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x69 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.sll.b rD,rA,rB 

Description: 

The contents of byte elements of general-purpose register rA are shifted left by the 
number of bits specified in the lower 3 bits in each byte element of general-purpose 
register rB, inserting zeros into the low-order bits. The result elements are placed into 
general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] - rA[7:0] << rB[2:0] 
rD[15:8] < rA[15:8] << rB[10:8] 
rD[ 23:16] <— rA[23:16] << rB[18:16] 
rD[31:24] <- rA[31:24] << rB[26:24] 
rD[ 39:32] <— rA[39:32] << rB[34:32] 
rD[47:40] <— rA[47:40] << rB[42:40] 
rD[55:48] <- rA[55:48] << rB[50:48] 
rD[63:56] <— rA[63:56] << rB[58:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Shift Left 


Iv.sil.h : Iv.sll.h 
Logical 
B31. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjzo |]. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode Ox6a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.sll.h rD,rA,rB 

Description: 

The contents of half-word elements of general-purpose register rA are shifted left by the 
number of bits specified in the lower 4 bits in each half-word element of general-purpose 
register rB, inserting zeros into the low-order bits. The result elements are placed into 
general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rA[15:0] << rB[3:0] 
rD[31:16] <— rA[31:16] << rB[19:16] 
rD[ 47:32] <— rA[47:32] << rB[35:32] 
rD[63:48] <— rA[63:48] << rB[51:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Shift Right 


lv.sra.b ; : lv.sra.b 
Arithmetic 
31) |. fief. He} 262s Td). ]} 2tiao TL Tey} teas Jf.) Fe TF ato : aly i. |i |i. |. te 0 
opcode Oxa D A B reserved opcode Ox6e 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.sra.b rD,rA,rB 

Description: 

The contents of byte elements of general-purpose register rA are shifted right by the 
number of bits specified in the lower 3 bits in each byte element of general-purpose 
register rB, inserting the most significant bit of each element into the high-order bits. The 
result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[ 7:0] «- rA[7:0] sra rB[2:0] 
rD[15:8] < rA[15:8] sra rB[10:8] 
rD[23:16] -— rA[23:16] sra rB[18:16] 
rD[31:24] < rA[31:24] sra rB[26:24] 
rD[39:32] <— rA[39:32] sra rB[34:32] 
rD[ 47:40] <- rA[47:40] sra rB[ 42:40] 
rD[55:48] < rA[55:48] sra rB[50:48] 
rD[63:56] - rA[63:56] sra rB[58:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Shift Right 


Iv.sra.h : : lv.sra.h 
Arithmetic 
B31}. |). |]. |]. |] 26125 |. Jf. |]. |) 2tjao |]. |). fe |} teas |. ff. fe J] tate ; B77 |f. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode Ox6f 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.sra.h rD,rA,rB 

Description: 

The contents of half-word elements of general-purpose register rA are shifted right by the 
number of bits specified in the lower 4 bits in each half-word element of general-purpose 
register rB, inserting the most significant bit of each element into the high-order bits. The 
result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] <« rA[15:0] sra rB[3:0] 
rD[31:16] - rA[31:16] sra rB[19:16] 
rD[ 47:32] <— rA[47:32] sra rB[35:32] 
rD[63:48] - rA[63:48] sra rB[51:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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lv.srl Vector Shift Right Logical lv.srl 
B31 ||. |]. |]. |]. |] 2625 |. Jf. |]. |) 2tjeo Jf. |). fe |} teas |). ff. J. |] tate B77 if. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x70 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.srl rD,rA,rB 

Description: 

The contents of general-purpose register rA are shifted right by the number of bits 
specified in the lower 4 bits in each byte element of general-purpose register rB, inserting 


zeros into the high-order bits of rD. The result elements are placed into general-purpose 
register rD. 


Note: The ORVDX instruction set is not completely specified. This instruction is 
incorrectly specified in that implementation below does not operate in a vector fashion 
and no element size is specified in the mnemonic. It may be a remnant of a template or 
lv.srl.b. 
32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[63:0] < rA[63:0] >> rB[2:0] 
Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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lv.srl.b Vector Byte Elements Shift Right Logical lv.srl.b 


31 ||. |). Jf. |]. |] 26@5 |. Jf. |}. |) 2tjao yf. fe fe |} teas |). ff. fe J] tate a7 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x6c 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.srl.b rD,rA,rB 

Description: 

The contents of byte elements of general-purpose register rA are shifted right by the 
number of bits specified in the lower 3 bits in each byte element of general-purpose 
register rB, inserting zeros into the high-order bits. The result elements are placed into 
general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] - rA[7:0] >> rB[2:0] 
rD[15:8] < rA[15:8] >> rB[10:8] 
rD[ 23:16] <— rA[23:16] >> rB[18:16] 
rD[31:24] <- rA[31:24] >> rB[26:24] 
rD[ 39:32] <— rA[39:32] >> rB[34:32] 
rD[47:40] <- rA[47:40] >> rB[42:40] 
rD[55:48] <- rA[55:48] >> rB[50:48] 
rD[63:56] <— rA[63:56] >> rB[58:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Shift Right " 


Iv.srl.h ‘ v.srl.h 
Logical 
B31. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjzo |]. |). fe |} teas |. ff. Jf. J] tate ; a7 |i. ||. He le [fe | 0 
opcode Oxa D A B reserved opcode Ox6d 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.srl.h rD,rA,rB 

Description: 

The contents of half-word elements of general-purpose register rA are shifted right by the 
number of bits specified in the lower 4 bits in each half-word element of general-purpose 
register rB, inserting zeros into the high-order bits. The result elements are placed into 
general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rA[15:0] >> rB[3:0] 
rD[31:16] < rA[31:16] >> rB[19:16] 
rD[ 47:32] <— rA[47:32] >> rB[35:32] 
rD[63:48] <— rA[63:48] >> rB[51:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Iv.sub.b Vector Byte Elements Subtract Signed lv.sub.b 


31 


opcode Oxa 
6 bits 


Format: 


2625 


5 bits 


lv.sub.b rD,rA,rB 


Description: 


2120 


1615 


5 bits 


1110 


opcode 0x71 
8 bits 


The byte elements of general-purpose register rB are subtracted from the byte elements of 
general-purpose register rA to form the result elements. The result elements are placed 


into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[ 7:0] 


rD[15: 
rD[23: 
rD[ 31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8] 

16] 
24] 
32) 
4Q] 
48] 
56] 


Lot or th te ot 


rA[7:0] 


rA[15: 
rAl23: 
rA[31: 
rA[39: 
rA[47: 
rA[55: 
rA[63: 
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8] 

16] 
24] 
32] 
4Q] 
48] 
56] 


rB[L 7:0] 


rB[15: 
rB[23: 
rB[ 31: 
rB[39: 
rB[ 47: 
rB[55: 
rB[63: 


Instruction Class 
ORVDX64 I 
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16] 
24] 
32] 
4Q] 
48] 
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Vector Half-Word Elements Subtract 
lv.sub.h 


: lv.sub.h 
Signed 
B31}. |. |]. |]. |] 2625 |. Jf. |}. |) 2tjzo |]. |). fe |} teas |). ff. Jf. J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x72 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.sub.h rD,rA,rB 

Description: 

The half-word elements of general-purpose register rB are subtracted from the half-word 
elements of general-purpose register rA to form the result elements. The result elements 
are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rA[15:0] - rB[15:0] 
rD[31:16] -— rA[31:16] - rB[31:16] 
rD[47:32] <— rA[47:32] - rB[47:32] 
rD[63:48] <— rA[63:48] - rB[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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b Vector Byte Elements Subtract Signed icconhe 


Iv.subs. 
Saturated 
B31. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjeo |]. |). fe |} teas |. ff. Jf. |] tate ; a7 |i. |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x73 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 


Format: 

lv.subs.b rD,rA, rB 

Description: 

The byte elements of general-purpose register rB are subtracted from the byte elements of 
general-purpose register rA to form the result elements. If the result exceeds the min/max 
value for the destination data type, it is saturated to the min/max value and placed into 
general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] - sat8s(rA[7:0] - rB[7:0]) 
rD[15:8] < sat8s(rA[15:8] - rB[15:8]) 
rD[23:16] -— sat8s(rA[23:16] - rB[23:16]) 
rD[31:24] -— sat8s(rA[31:24] - rB[31:24]) 
rD[39:32] <— sat8s(rA[39:32] - rB[39:32]) 
rD[47:40] - sat8s(rA[47:40] - rB[47:40]) 
rD[ 55:48] - sat8s(rA[55:48] - rB[55:48]) 
rD[63:56] <— sat8s(rA[63:56] - rB[63:56]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 


www.opencores.org 1.2-1 245 of 357 





OpenCores OpenRISC 1000 Architecture Manual September 7, 2017 


Vector Half-Word Elements Subtract 
lv.subs.h 


: Iv.subs.h 
Signed Saturated 
B31. |]. |]. |]. |] 2625 |. Jf. |}. |) 2tjeo |]. |). i. |} teas |. ff. Jf. |] tate ; a7 lef. He le [fe | 0 
opcode Oxa D A B reserved opcode 0x74 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.subs.h rD,rA, rB 

Description: 

The half-word elements of general-purpose register rB are subtracted from the half-word 
elements of general-purpose register rA to form the result elements. If the result exceeds 
the min/max value for the destination data type, it is saturated to the min/max value and 
placed into general-purpose register rD. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] <« sati6s(rA[15:0] - rB[15:0]) 

rD[31:16] - sati6s(rA[31:16] - rB[31:16]) 

rD[ 47:32] -— sati6s(rA[47:32] - rB[47:32]) 

rD[63:48] -— sati6s(rA[63:48] - rB[63:48] ) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Subtract 


Iv.subu.b : Iv.subu.b 
Unsigned 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajeo |]. |). i. |} teas |. ff. Jf. J] tate ; B77 |f. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x75 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.subu.b rD,rA, rB 

Description: 

The unsigned byte elements of general-purpose register rB are subtracted from the 
unsigned byte elements of general-purpose register rA to form the result elements. The 
result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] <rA[7:0] - rB[7:0] 
rD[15:8] << rA[15:8] - rB[15:8] 
rD[23:16] <— rA[23:16] - rB[23:16] 
rD[31:24] <— rA[31:24] - rB[31:24] 
rD[39:32] -— rA[39:32] - rB[39:32] 
rD[ 47:40] < rA[47:40] - rB[47:40] 
rD[55:48] < rA[55:48] - rB[55:48] 
rD[63:56] -— rA[63:56] - rB[63:56] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements Subtract 


Iv.subu.h , Iv.subu.h 
Unsigned 
B31}. |. |]. |]. |] 26125 |. Jf. |}. |) 2ajeo |]. |). i. |} teas |. ff. Jf. J] tate ; a7 |i. |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x76 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.subu.h rD,rA, rB 

Description: 

The unsigned half-word elements of general-purpose register rB are subtracted from the 
unsigned half-word elements of general-purpose register rA to form the result elements. 
The result elements are placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[15:0] < rA[15:0] - rB[15:0] 
rD[31:16] -— rA[31:16] - rB[31:16] 
rD[47:32] <— rA[47:32] - rB[47:32] 
rD[63:48] <— rA[63:48] - rB[63:48] 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Byte Elements Subtract 


Iv.subus.b . Iv.subus.b 
Unsigned Saturated 
B31. |. |]. |]. |] 26125 |. Jf. |]. |) 2tjzo |]. |). |. |} teas |). ff. fe J] tate ; a7 fe |. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x77 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.subus.b rD,rA,rB 

Description: 

The unsigned byte elements of general-purpose register rB are subtracted from the 
unsigned byte elements of general-purpose register rA to form the result elements. If the 
result exceeds the min/max value for the destination data type, it is saturated to the 
min/max value and placed into general-purpose register rD. 

32-bit Implementation: 

N/A 


64-bit Implementation: 


rD[7:0] - sat8u(rA[7:0] - rB[7:0]) 
rD[15:8] < sat8u(rA[15:8] - rB[15:8]) 
rD[ 23:16] -— sat8u(rA[23:16] - rB[23:16]) 
rD[31:24] -— sat8u(rA[31:24] - rB[31:24]) 
rD[39:32] <— sat8u(rA[39:32] - rB[39:32]) 
rD[47:40] - sat8u(rA[47:40] - rB[47:40]) 
rD[ 55:48] -— sat8u(rA[55:48] - rB[55:48]) 
rD[63:56] <— sat8u(rA[63:56] - rB[63:56]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


Iv.subus.h ; Iv.subus.h 
Subtract Unsigned Saturated 
B31. |. |]. |]. |] 2625 |. Jf. |]. |) 2tjeo |]. |). fe |} teas |. fi. fe J] tate ; a7 |i. ||. |e le {fe | 0 
opcode Oxa D A B reserved opcode 0x78 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.subus.h rD,rA,rB 

Description: 

The unsigned half-word elements of general-purpose register rB are subtracted from the 
unsigned half-word elements of general-purpose register rA to form the result elements. 
If the result exceeds the min/max value for the destination data type, it is saturated to the 
min/max value and placed into general-purpose register rD. 

32-bit Implementation: 

N/A 

64-bit Implementation: 

rD[15:0] <« sati6u(rA[15:0] - rB[15:0]) 

rD[31:16] -— sati6u(rA[31:16] - rB[31:16]) 

rD[47:32] -— sati6u(rA[47:32] - rB[47:32]) 

rD[63:48] -— sati6u(rA[63:48] - rB[63:48]) 

Exceptions: 


None 


Instruction Class 
ORVDX64 I 
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Iv.unpack.b Vector Byte Elements Unpack lv.unpack.b 


31 


opcode Oxa 
6 bits 


Format: 


2625 


21/20 =|. |. ||. 1615 3... 1110 ; 87 Tey. df. dhe le |. 0 


D A B reserved opcode 0x79 
5 bits 5 bits 5 bits 3 bits 8 bits 


lv.unpack.b rD,rA,rB 


Description: 


The lower half of the 4-bit elements in general-purpose register rA are sign-extended and 
placed into general-purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[7:0] 


rD[15: 
rD[23: 
rD[31: 
rD[39: 
rD[47: 
rD[55: 
rD[63: 


Exceptions: 


None 


8] 

16] 
24] 
32] 
40] 
48] 
56] 


tf 1 TT t tt Tt 


exts(rA[3:0]) 

exts(rA[7:4]) 

exts(rA[11:8]) 
exts(rA[15:12]) 
exts(rA[19:16] ) 
exts(rA[23:20] ) 
exts(rA[27:24]) 
exts(rA[31:28] ) 


Instruction Class 
ORVDX64 I 
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Vector Half-Word Elements 


Iv.unpack.h lv.unpack.h 
Unpack 
B31}. |. |]. |]. |] 26125 |. Jf. |). |) 2tjzo |]. |). fe |} teas |. fi. Jf. J] tate ; a7 |i. ||. |e le [fe | 0 
opcode Oxa D A B reserved opcode 0x7a 
6 bits 5 bits 5 bits 5 bits 3 bits 8 bits 
Format: 


lv.unpack.h rD,rA,rB 
Description: 


The lower half of the 8-bit elements in general-purpose register rA are sign-extended and 
placed into general-purpose register rD. 


32-bit Implementation: 
N/A 


64-bit Implementation: 


rD[15:0] < exts(rA[7:0]) 
rD[31:16] - exts(rA[15:8] ) 
rD[47:32] < exts(rA[23:16]) 
rD[63:48] < exts(rA[31:24]) 
Exceptions: 

None 


Instruction Class 
ORVDX64 I 
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lv.xor 
31 ||. |}. J]. |i. || 26/25 
opcode Oxa D 
6 bits 5 bits 
Format: 


lv.xor rD,rA,rB 


Description: 


OpenRISC 1000 Architecture Manual September 7, 2017 
Vector Exclusive Or Iv.xor 
21/20 ||. |. ||. 1615 ||. ||. ||. 1110 ; 87 |i. |). ||. {fe fle |. 0 
A B reserved opcode 0x7b 
5 bits 5 bits 3 bits 8 bits 


The contents of general-purpose register rA are combined with the contents of general- 
purpose register rB in a bit-wise logical XOR operation. The result is placed into general- 


purpose register rD. 


32-bit Implementation: 


N/A 


64-bit Implementation: 


rD[63:0] < rA[63:0] XOR rB[63:0] 


Exceptions: 


None 
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Instruction Class 
ORVDX64 I 
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6 Exception Model 


This chapter describes the various exception types and their handling. 


6.1 Introduction 


The exception mechanism allows the processor to change to supervisor state as a 
result of external signals, errors, or unusual conditions arising in the execution of 
instructions. When exceptions occur, information about the state of the processor is saved 
to certain registers and the processor begins execution at the address predetermined for 
each exception. Processing of exceptions begins in supervisor mode. 


The OpenRISC 1000 arcitecture has special support for fast exception processing — 
also called fast context switch support. This allows very rapid interrupt processing. It is 
achieved with shadowing general-purpose and some special registers. 


The architecture requires that all exceptions be handled in strict order with respect 
to the instruction stream. When an instruction-caused exception is recognized, any 
unexecuted instructions that appear earlier in the instruction stream are required to 
complete before the exception is taken. 


Exceptions can occur while an exception handler routine is executing, and multiple 
exceptions can become nested. Support for fast exceptions allows fast nesting of 
exceptions until all shadowed registers are used. If context switching is not implemented, 
nested exceptions should not occur. 


6.2 Exception Classes 


All exceptions can be described as precise or imprecise and either synchronous or 
asynchronous. Synchronous exceptions are caused by instructions and asynchronous 
exceptions are caused by events external to the processor. 

















Type Exception 
Asynchronous/nonmaskable Bus Error, Reset 
Asynchronous/maskable External Interrupt, Tick Timer 
Synchronous/precise Instruction-caused exceptions 
Synchronous/imprecise None 

















Table 6-1. Exception Classes 


Whenever an exception occurs, current PC is saved to current EPCR and new PC 
is set with the vector address according to Table 6-2. 














Exception Type Vector Causal Conditions 
Offset 
Reset 0x100 Caused by software or hardware reset. 
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Exception Type Vector Causal Conditions 
Offset 
Bus Error 0x200 The causes are implementation-specific, but typically 
they are related to bus errors and attempts to access 
invalid physical address. 
Data Page Fault 0x300 No matching PTE found in page tables or page 
protection violation for load/store operations. 
Instruction Page Fault 0x400 No matching PTE found in page tables or page 
protection violation for instruction fetch. 

Tick Timer 0x500 Tick timer interrupt asserted. 

Alignment 0x600 Load/store access to naturally not aligned location. 
Illegal Instruction 0x700 Illegal instruction in the instruction stream. 
External Interrupt 0x800 External interrupt asserted. 

D-TLB Miss 0x900 No matching entry in DTLB (DTLB miss). 

I-TLB Miss OxA00 No matching entry in ITLB (ITLB miss). 

Range OxBOO If programmed in the SR, the setting of certain flags, 
like SR[OV], causes a range exception. On OpenRISC 
implementations with less than 32 GPRs when 
accessing unimplemented architectural GPRs. On all 
implementations if SR[CID] had to go out of range in 
order to process next exception. 
System Call OxCO00 System call initiated by software. 
Floating Point OxDO00 Caused by floating point instructions when FPCSR 
status flags are set by FPU and FPCSR[FPEE] is set 
Trap OxE0O Caused by the I.trap instruction or by debug unit. 
Reserved OxFOO — Reserved for future use. 
0x1400 

Reserved 0x1500 — Reserved for implementation-specific exceptions. 

0x1800 

Reserved 0x1900 — Reserved for custom exceptions. 

0x1F0O 
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Table 6-2. Exception Types and Causal Conditions 
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6.3 Exception Processing 


Whenever an exception occurs, the current/next PC is saved to the current EPCR. 
If the CPU implements delay-slot execution (CPUCFGRI[ND] is not set) and the PC 
points to the delay-slot instruction, PC-4 is saved to the current EPCR and SR[DSX] is 
set. Table 6-3 defines what are current/next PC and effective address. 


The SR is saved to the current ESR. 
Current EPCR/ESR are identified by SR[CID]. If fast context switching is not 
implemented then current EPCR/ESR are always EPCRO/ESRO. 


In addition, the current EEAR is set with the effective address in question if one of 
the following exceptions occurs: Bus Error, IMMU page fault, DMMU page fault, 
Alignment, I-TLB miss, D-TLB miss. 


August 20, 2017 






































Exception Priority EPCR EPCR EEAR 
(no delay slot) (delay slot) 
Reset 1 - - - 
Bus Error 4 (insn) | Address of instruction | Address of jump instruction Load/ 
9 (data) | that caused exception before the instruction that store/fetch 
caused exception virtual EA 
Data Page 8 Address of instruction | Address of jump instruction | Load/store 
Fault that caused exception | before the instruction that virtual EA 
caused exception 
Instruction 3 Address of instruction | Address of jump instruction | Instruction 
Page Fault that caused exception | before the instruction that fetch 
caused exception virtual EA 
Tick Timer 12 Address of next not Address of just executed - 
executed instruction jump instruction 
Alignment 6 Address of instruction | Address of jump instruction | Load/store 
that caused exception | before the instruction that virtual EA 
caused exception 
Illegal 5 Address of instruction | Address of jump instruction | Instruction 
Instruction that caused exception | before the instruction that fetch 
caused exception virtual EA 
External 12 Address of next not Address of just executed - 
Interrupt executed instruction jump instruction 
D-TLB Miss 7 Address of instruction | Address of jump instruction | Load/store 
that caused exception | before the instruction that virtual EA 
caused exception 
I-TLB Miss 2 Address of instruction | Address of jump instruction | Instruction 
that caused exception | before the instruction that fetch 
caused exception virtual EA 
Range 10 Address of instruction | Address of jump instruction - 
that caused exception | before the instruction that 
caused exception 
System Call 7 Address of next not Address of just executed - 
executed instruction jump instruction 
Floating 11 Address of next not Address of just executed - 
Point executed instruction jump instruction 
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Exception Priority EPCR EPCR EEAR 
(no delay slot) (delay slot) 
Address of instruction | Address of jump instruction - 












before the instruction that 
caused exception 


Trap 7 
that caused exception 





Table 6-3. Values of EPCR and EEAR After Exception 


If fast context switching is used, SR[CID] is incremented with each new exception 
so that a new set of shadowed registers is used. If SR[CID] will overflow with the current 
exception, a range exception is invoked. 


However, if SR[CE] is not set, fast context switching is not enabled. In this case all 
registers that will be modified by exception handler routine must first be saved. 


All exceptions set a new SR where both MMUs are disabled (address translation 
disabled), supervisor mode is turned on, and tick timer exceptions and interrupts are 
disabled. (SR[DME]=0, SRIIME]=0, SR[ISM]=1, SR[TEE]=0 and SR[TEE]=0). 

When enough machine state information has been saved by the exception handler, 
SR[TTE] and SR[IEE] can be re-enabled so that tick timer and external interrupts are not 
blocked. 


When returning from an exception handler with I.rfe, SR and PC are restored. If 
SR[CE] is set, CID will be automatically decremented and the previous machine state 
will be restored; otherwise, general-purpose registers previously saved by exception 
handler need to be restored as well. 


6.3.1 Particular delay slot issues 


Instructions placed in the delay slot will cause EPCR to be set to the address of the 
jump instruction, not the delay slot or target instruction. Because of this, two categories 
of instruction should never be placed in the delay slot: 

1. Instructions altering the conditions of the jump itself. This is why 1|.jr must not 
have a delay slot instruction modify the target address register. 

2. Instructions consistently causing an exception, such as |.sys. Normally l|.sys 
returns to continue execution, but if placed in a delay slot it instead causes a 
repeat of the system call itself. 


ltrap is generally used as a software breakpoint, so may not have the same 
concern. 


6.4 Fast Context Switching (Optional) 


Fast context switching is a technique that reduces register storing to stack when 
exceptions occur. Only one type of exception can be handled, so it is up to the software to 
figure out what caused it. Using software, both interrupt handler invokation and thread 
switching can be handled very quickly. The hardware should be capable of switching 
between contexts in only one cycle. 


Context can also be switched during an exception or by using a supervisor register 
CXR (context register) available only in supervisor mode. CXR is the same for all 
contexts. 
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6.4.1 Changing Context in Supervisor Mode 


The read/write register CXR consists of two parts: the lower 16 bits represents the 
current context register set. The upper 16 bits represent the current CID. CCID cannot be 
accessed in user mode. Writing to CCID causes an immediate context change. Reading 
from CCID returns the running (current) context ID. The context where CID=0 is also 
called the main context. 











| BIT | 31-16 | 15-0 
| Identifier | CCID | CCRS 
Reset 0 0 


CCRS has two functions: 
v When an exception occurs, it holds the previous CID. 
vY It is used to access other context's registers. 


6.4.2 Context Switch Caused by Exception 


When an exception occurs and fast context switching is enabled, the CCID is 
copied to CCRS and then set to zero, thus switching to main context. 


Functions of the main context are: 

¥ Switching between threads 

vY Handling exceptions 

v Preparing, loading, saving, and releasing context identifiers to/from the CID table 


CXR should be stored in a general-purpose register as soon as possible, to allow 
further exception nesting. 


The following table shows an example how the CID table could be used. 
Generally, there is no need that free exception contexts are equal. 














CID Function 
7 
6 Exception contexts 
5 
4 
3 
; Thread contexts 
1 
0 Main context 














Four thread contexts are loaded, and software can switch between them freely 
using main context, running in supervisor mode. When an exception occurs, first need to 
be determined what caused it and switch to the next free exception context. Since 
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exceptions can be nested, more free contexts may have to be available. Some of the 
contexts thus need to be stored to memory in order to switch to a new exception. 


The algorithm used in the main context to handle context saving/restoring and 
switching can be kept as simple as possible. It should have enough (of its own) registers 


to store information such as: 
¥ Current running CID 

v Next exception 

v¥ Thread cycling info 

Y Pointers to context table in memory 
¥ Copy of CXR 


If the number of interrupts is significant, some sort of defered interrupts calls 
mechanism can be used. The main context algorithm should store just I/O information 
passed by the interrupt for further execution and return from main context as soon as 
possible. 


6.4.3 Accessing Other Contexts’ Registers 


This operation can be done only in supervisor mode. In the basic instruction set we 
have the |.mtspr and |.mfspr instructions that are used to access shadowed registers. 
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7 Memory Model 


This chapter describes the OpenRISC 1000 weakly ordered memory model. 


7.1 Memory 


Memory is byte-addressed with halfword accesses aligned on 2-byte boundaries, 
singleword accesses aligned on 4-byte boundaries, and doubleword accesses aligned on 
8-byte boundaries. 


7.2 Memory Access Ordering 


The OpenRISC 1000 architecture specifies a weakly ordered memory model for 
uniprocessor and shared memory multiprocessor systems. This model has the advantage 
of a higher-performance memory system but places the responsibility for strict access 
ordering on the programmer. 


The order in which the processor performs memory access, the order in which 
those accesses complete in memory, and the order in which those accesses are viewed by 
another processor may all be different. Two means of enforcing memory access ordering 
are provided to allow programs in uniprocessor and multiprocessor system to share 
memory. 


An OpenRISC 1000 processor implementation may also implement a more 
restrictive, strongly ordered memory model. Programs written for the weakly ordered 
memory model will automatically work on processors with strongly ordered memory 
model. 


7.2.1 Memory Synchronize Instruction 


The I.msync instruction permits the program to control the order in which load and 
store operations are performed. This synchronization is accomplished by requiring 
programs to indicate explicitly in the instruction stream, by inserting a memory sync 
instruction, that synchronization is required. The memory sync instruction ensures that all 
memory accesses initiated by a program have been performed before the next instruction 
is executed. 

OpenRISC 1000 processor implementations, that implement the strongly-ordered 
memory model instead of the weakly-ordered one, can execute memory synchronization 
instruction as a no-operation instruction. 


7.2.2 Pages Designated as Weakly-Ordered-Memory 


When a memory page is designated as a Weakly-Ordered-Memory (WOM) page, 
instructions and data can be accessed out-of-order and with prefetching. When a page is 
designated as not WOM, instruction fetches and load/store operations are performed in- 
order without any prefetching. 
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OpenRISC 1000 scalar processor implementations, that implement strongly- 
ordered memory model instead of the weakly-ordered one and perform load and store 
operations in-order, are not required to implement the WOM bit in the MMU. 


7.3 Atomicity 


A memory access is atomic if it is always performed in its entirety with no visible 
fragmentation. Atomic memory accesses are specifically required to implement software 
semaphores and other shared structures in systems where two different processes on the 
same processor, or two different processors in a multiprocessor environment, access the 
same memory location with intent to modify it. 


The OpenRISC 1000 architecture provides two dedicated instructions that together 
perform an atomic read-modify-write operation. 
1.lwa rD, I(rA) 
1.swa I(rA), rB 
Instruction llwa loads single word from memory, creating a reservation for a 
subsequent conditional store operation. A special register, invisible to the programmer, is 
used to hold the address of the memory location, which is used in the atomic read- 
modify-write operation. 


The reservation for a subsequent I.swa is cancelled if another store overlapping 
the same memory location occurs, another master writes overlapping same memory 
location (snoop hit), another ].swa (to any memory location) is executed, another |.lwa is 
executed or a context switch (exception) occur. Keep in mind that the overlapping stores 
may be byte or half-word size. 


If a reservation is still valid when the corresponding I.swa is executed, ].swa stores 
general-purpose register rB into the memory and SR[F] is set. 


If the reservation was cancelled, ].swa does not perform the store to memory and SR[F] is 
cleared. 


In implementations that use a weakly-ordered memory model, |.swa and 1|.l!wa will 
serve as synchronization points, similar to |.msync. 
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8 Memory Management 


This chapter describes the virtual memory and access protection mechanisms for 
memory management within the OpenRISC 1000 architecture. 


Note that this chapter describes the address translation mechanism from the 
perspective of the programming model. As such, it describes the structure of the page 
tables, the MMU conditions that cause MMU related exceptions and the MMU registers. 
The hardware implementation details that are invisible to the OpenRISC 1000 
programming model, such as MMU organization and TLB size, are not contained in the 
architectural definition. 


8.1 MMU Features 


The OpenRISC 1000 memory management unit includes the following principal 
features: 
¥ Support for effective address (EA) of 32 bits and 64 bits 
¥ Support for implementation specific size of physical address spaces up to 35 address 
bits (32 GByte) 
v¥ Three different page sizes: 
> Level 0 pages (32 Gbyte; only with 64-bit EA) translated with D/I Area 
Translation Buffer (ATB) 
> Level 1 pages (16 MByte) translated with D/I Area Translation Buffer (ATB) 
> Level 2 pages (8 Kbyte) translated with D/I Translation Lookaside Buffer (TLB) 
v Address translation using one-, two- or three-level page tables 
Y Powerful page based access protection with support for demand-paged virtual 
memory 
¥ Support for simultaneous multi-threading (SMT) 


8.2 MMU Overview 


The primary functions of the MMU in an OpenRISC 1000 processor are to 
translate effective addresses to physical addresses for memory accesses. In addition, the 
MMU provides various levels of access protection on a page-by-page basis. Note that this 
chapter describes the conceptual model of the OpenRISC 1000 MMU and 
implementations may differ in the specific hardware used to implement this model. 


Two general types of accesses generated by OpenRISC 1000 processors require 
address translation — instruction accesses generated by the instruction fetch unit, and data 
accesses generated by the load and store unit. Generally, the address translation 
mechanism is defined in terms of page tables used by OpenRISC 1000 processors to 
locate the effective to physical address mapping for instruction and data accesses. 

The definition of page table data structures provides significant flexibility for the 
implementation of performance enhancement features in a wide range of processors. 
Therefore, the performance enhancements used to the page table information on-chip 
vary from implementation to implementation. 
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Translation lookaside buffers (TLBs) are commonly implemented in OpenRISC 
1000 processors to keep recently-used page address translations on-chip. Although their 
exact implementation is not specified, the general concepts that are pertinent to the 
system software are described. 
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Figure 8-1. Translation of Effective to Physical Address — Simplified block diagram for 32-bit 
processor implementations 


Large areas can be translated with optional facility called Area Translation Buffer 
(ATB). ATBs translate 16MB and 32GB pages. If xTLB and xATB have a match on the 
same virtual address, xTLB is used. 

The MMU, together with the exception processing mechanism, provides the 
necessary support for the operating system to implement a paged virtual memory 
environment and for enforcing protection of designated memory areas. 
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8.3 MMU Exceptions 


To complete any memory access, the effective address must be translated to a 
physical address. An MMU exception occurs if this translation fails. 

TLB miss exceptions can happen only on OpenRISC 1000 processor 
implementations that do TLB reload in software. 

The page fault exceptions that are caused by missing PTE in page table or page 
access protection can happen on any OpenRISC 1000 processor implementations. 

















EXCEPTION NAME | VECTOR OFFSET CAUSING CONDITIONS 
Data Page Fault 0x300 No matching PTE found in page tables or page 
protection violation for load/store operations. 
Instruction Page 0x400 No matching PTE found in page tables or page 
Fault protection violation for instruction fetch. 
DTLB Miss 0x900 No matching entry in DTLB. 
ITLB Miss OxA00 No matching entry in ITLB. 





Table 8-1. MMU Exceptions 


The vector offset addresses in table are subject to the presence and setting of the of 
the Exception Vector Base Address Register (EVBAR) may have configured the 
exceptions to be processed at a different offset, however the least-significant 12-bit offset 
address remain the same. 


The state saved by the processor for each of the exceptions in Table 9-2 contains 
information that identifies the address of the failing instruction. Refer to the chapter 
entitled “Exception Processing” on page 256 for a more detailed description of exception 
processing. 


8.4 MMU Special-Purpose Registers 


Table 8-2 summarizes the registers that the operating system uses to program the 
MMU. These registers are 32-bit special-purpose supervisor-level registers accessible 
with the |.mtspr/l.mfspr instructions in supervisor mode only. 

Table 8-2 does not show two configuration registers that are implemented if 
implementation implements configuration registers. DMMUCFGR and IMMUCFGR 
describe capability of DMMU and IMMU. 




















Grp # Reg # Reg Name USER SUPV Description 
MODE MODE 
1 0 DMMUCR - R/W Data MMU Control register 
1 1 DMMUPR - R/W Data MMU Protection Register 
1 2 DTLBEIR - W Data TLB Entry Invalidate 
register 
1 4-7 DATBMRO- - R/W Data ATB Match registers 
DATBMR3 
1 8-11 DATBTRO- - R/W Data ATB Translate registers 
DATBTR3 
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Grp # Reg # Reg Name USER SUPV Description 
MODE MODE 
1 512- DTLBWOMRO- - R/W Data TLB Match registers Way 
639 DTLBWOMR127 0 
1 640- DTLBWOTRO- - R/W Data TLB Translate registers 
767 DTLBWOTR127 Way 0 
1 768- DTLBW1MRO- - R/W Data TLB Match registers Way 
895 DTLBW1MR127 1 
1 896- DTLBW1TRO- - R/W Data TLB Translate registers 
1023 DTLBW1TR127 Way 1 
1 1024- DTLBW2MRO- - R/W Data TLB Match registers Way 
1151 DTLBW2MR127 2 
1 1152- DTLBW2TRO- - R/W Data TLB Translate registers 
1279 DTLBW2TR127 Way 2 
1 1280- DTLBW3MRO- - R/W Data TLB Match registers Way 
1407 DTLBW3MR127 3 
1 1408- DTLBW3TRO- - R/W Data TLB Translate registers 
1535 DTLBW3TR127 Way 3 
2 0 IMMUCR - R/W Instruction MMU Control 
register 
2 1 IMMUPR - R/W Instruction MMU Protection 
Register 
2 2 ITLBEIR - Ww Instruction TLB Entry 
Invalidate register 
2 4-7 IATBMRO- - R/W Instruction ATB Match 
IATBMR3 registers 
2 8-11 IATBTRO- - R/W Instruction ATB Translate 
IATBTR3 registers 
2 512- ITLBWOMRO- - R/W Instruction TLB Match 
639 ITLBWOMR127 registers Way 0 
2 640- ITLBWOTRO- - R/W Instruction TLB Translate 
767 ITLBWOTR127 registers Way 0 
2 768- ITLBW1MRO- - R/W Instruction TLB Match 
895 ITLBW1MR127 registers Way 1 
2 896- ITLBW1TRO- - R/W Instruction TLB Translate 
1023 ITLBW1TR127 registers Way 1 
2 1024- ITLBW2MRO- - R/W Instruction TLB Match 
1151 ITLBW2MR127 registers Way 2 
2 1152- ITLBW2TRO- - R/W Instruction TLB Translate 
1279 ITLBW2TR127 registers Way 2 
2 1280- ITLBW3MRO- - R/W Instruction TLB Match 
1407 ITLBW3MR127 registers Way 3 
2 1408- ITLBW3TRO- - R/W Instruction TLB Translate 
1535 ITLBW3TR127 registers Way 3 
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As TLBs are noncoherent caches of PTEs, software that changes the page tables in 
any way must perform the appropriate TLB invalidate operations to keep the on-chip 
TLBs coherent with respect to the page tables in memory. 


8.4.1 Data MMU Control Register (DMMUCR) 


The DMMUCR is a 32-bit special-purpose supervisor-level register accessible 
with the |.mtspr/l.mfspr instructions in supervisor mode. 


It provides general control of the DMMU. 











Bit 31-10 9-1 0 
Identifier PTBP Reserved DTF 
Reset 0 Xx 0 

R/W R/W R R/W 


























DTF DTLB Flush 
O DTLB ready for operation 
1 DTLB flush request/status 





PTBP Page Table Base Pointer 
N 22-bit pointer to the base of page directory/table 








Table 8-3. DMMUCR Field Descriptions 


The PTBP field in the DMMUCR is required only in implementations with 
hardware PTE reload support. Implementations that use software TLB reload are not 
required to implement this field because the page table base pointer is stored in a TLB 
miss exception handler’s variable. 


The DTF is optional and when implemented it flushes entire DTLB. 


8.4.2 Data MMU Protection Register (DMMUPR) 


The DMMUPR is a 32-bit special-purpose supervisor-level register accessible with 
the |.mtspr/l.mfspr instructions in supervisor mode. 


It defines 7 protection groups indexed by PPI fields in PTEs. 








25 | 24 
SWE7 | SRE7 


Bit 31-28 27 | 26 
Identifier Reserved UWE7 | URE7 
x 












Reset 






























































Identifier 
Reset 0 0 0 0 0 0 0 0 
R/W R/W R/W R/W R/W R/W R/W R/W R/W 
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Bit 15 14 13 12 11 10 9 8 
Identifier UWE4 URE4 SWE4 SRE4 UWE3 URE3 SWE3 SRE3 
Reset 0 0 0 0 0 0 0 

R/W R/W R/W R/W R/W R/W R/W R/W R/W 
Bit 7 5 4 3 2 1 0 
Identifier UWE2 URE2 SWE2 SRE2 UWE1 URE1 SWE1 SRE1 
Reset 0 0 0 0 0 0 0 
R/W R/W R/W R/W R/W R/W R/W R/W R/W 
SREx Supervisor Read Enable x 
0 Load operation in supervisor mode not permitted 
1 Load operation in supervisor mode permitted 
SWEx Supervisor Write Enable x 
0 Store operation in supervisor mode not permitted 
1 Store operation in supervisor mode permitted 
UREx User Read Enable x 
0 Load operation in user mode not permitted 
1 Load operation in user mode permitted 
UWEx User Write Enable x 
0 Store operation in user mode not permitted 
1 Store operation in user mode permitted 











Table 8-4. DMMUPR Field Descriptions 


A DMMUPR is required only in implementations with hardware PTE reload 
support. Implementations that use software TLB reload are not required to implement this 
register; instead a TLB miss handler should have a software variable as replacement for 
the DMMUPR and it should do a software look-up operation and set DTLBWyTRx 


protection bits accordingly. 


8.4.3 


Instruction MMU Control Register (IMMUCR) 


The IMMUCR is a 32-bit special-purpose supervisor-level register accessible with 


the |.mtspr/l.mfspr instructions in supervisor mode. 


It provides general control of the IMMU. 






































Bit 31-10 9-1 0 
Identifier PTBP Reserved ITF 
Reset 0 x 0 

R/W R/W R R/W 
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ITF ITLB Flush 
0 ITLB ready for operation 
1 ITLB flush request/status 





PTBP Page Table Base Pointer 
N 22-bit pointer to the base of page directory/table 





Table 8-5. IMMUCR Field Descriptions 


The PTBP field in xMMUCR is required only in implementations with hardware 
PTE reload support. Implementations that use software TLB reload are not required to 
implement this field because the page table base pointer is stored in a TLB miss 
exception handler’s variable. 


The ITF is optional and when implemented it flushes entire ITLB. 


8.4.4 Instruction MMU Protection Register (IMMUPR) 


The IMMUP register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


It defines 7 protection groups indexed by PPI fields in PTEs. 
















































































Bit 31-14 
Identifier Reserved 
Reset Xx 
R/W R 
Bit 7 6 
Identifier UXE4 SXE4 
Reset 0 0 
R/W R/W R/W 
SXEx Supervisor Execute Enable x 
O Instruction fetch in supervisor mode not permitted 
1 Instruction fetch in supervisor mode permitted 
UXEx User Execute Enable x 
O Instruction fetch in user mode not permitted 
1 Instruction fetch in user mode permitted 








Table 8-6. IMMUPR Field Descriptions 


The IMMUPR is required only in implementations with hardware PTE reload 
support. Implementations that use software TLB reload are not required to implement this 
register; instead the TLB miss handler should have a software variable as replacement for 
the IMMUPR register and it should do a software look-up operation and set [T[LBWyTRx 
protection bits accordingly. 
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8.4.5 Instruction/Data TLB Entry Invalidate Registers 
(xTLBEIR) 


The instruction/data TLB entry invalidate registers are special-purpose registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. They are 32 bits wide 
in 32-bit implementations and 64 bits wide in 64-bit implementation. 

The xTLBEIR is written with the effective address. The corresponding xTLB entry 
is invalidated in the local processor. 

















Bit 31-0 
Identifier EA 
Reset 0 
R/W Write Only 











EA Effective Address 
EA that targets TLB entry inside TLB 


Table 8-7. xTLBEIR Field Descriptions 


8.4.6 Instruction/Data Translation Lookaside Buffer 
Way y Match Registers 
(xTLBWyMRO-xTLBWyMR127) 


The xTLBWyMR registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


Together with the xTLBWyTR registers they cache translation entries used for 
translating virtual to physical address. A virtual address is formed from the EA generated 
during instruction fetch or load/store operation, and the SR[CID] field. xTLBWyMR 
registers hold a tag that is compared with the current virtual address generated by the 
CPU core. Together with the xTLBWyTR registers and match logic they form a core part 
of the xMMU. 






































Bit 31-13 
Identifier VPN 
Reset x 
R/W R/W 
Bit 12-8 7-6 5-2 1 0 
Identifier Reserved LRU CID PL1 
Reset x 0 X 0 0 
R/W R R/W R/W R/W R/W 
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Vv Valid 
O TLB entry invalid 
1 TLB entry valid 
PL1 Page Level 1 
O Page level is 2 
1 Page level is 1 





CID Context ID 
0-15 TLB entry translates for CID 
LRU Last Recently used 
0-3 Index in LRU queue (lower the number, more recent access) 
VPN Virtual Page Number 








O-N Number of the virtual frame that must match EA 


Table 8-8. xTLBMR Field Descriptions 


The CID bits can be hardwired to zero if the implementation does not support fast 
context switching and SR[CID] bits. 
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8.4.7 Data Translation Lookaside Buffer Way y 
Translate Registers 
(DTLBWyTRO-DTLBWyTR127) 


The DTLBWyTR registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

Together with the DILBWyMR registers they cache translation entries used for 
translating virtual to physical address. A virtual address is formed from the EA generated 
during a load/store operation, and the SR[CID] field. Together with the DTLBWyMR 
registers and match logic they form a core of the DMMU. 













































































Bit 31-13 12-10 9 8 7 
Identifier PPN Reserved SWE SRE UWE 
Reset x x x Xx X 

R/W R/W R R/W R/W R/W 
Bit 6 5 4 2 1 0 
Identifier URE D A WBC Cl CC 
Reset Xx x x x Xx x 
R/W R/W R/W R/W R/W R/W R/W 
CC Cache Coherency 


0 Data cache coherency is not enforced for this page 
1 Data cache coherency is enforced for this page 
Cache Inhibit 
0 Cache is enabled for this page 
1 Cache is disabled for this page 
Write-Back Cache 
0 Data cache uses write-through strategy for data from this page 
1 Data cache uses write-back strategy for data from this page 





Cl 





WBC 


WOM Weakly-Ordered Memory 
0 Strongly-ordered memory model for this page 


1 Weakly-ordered memory model for this page 





Accessed 

O Page was not accessed 

1 Page was accessed 

Dirty 

O Page was not modified 

1 Page was modified 
URE User Read Enable x 
0 Load operation in user mode not permitted 

1 Load operation in user mode permitted 
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UWE User Write Enable x 
O Store operation in user mode not permitted 


1 Store operation in user mode permitted 





SRE Supervisor Read Enable x 

0 Load operation in supervisor mode not permitted 
1 Load operation in supervisor mode permitted 

SWE Supervisor Write Enable x 

0 Store operation in supervisor mode not permitted 


1 Store operation in supervisor mode permitted 





PPN Physical Page Number 
O-N Number of the physical frame in memory 











Table 8-9. DTLBTR Field Descriptions 


8.4.8 Instruction Translation Lookaside Buffer Way y 
Translate Registers 
(ITLBWyTRO-ITLBWyTR127) 


The ITLBWyTR registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


Together with the ITLBWyMR registers they cache translation entries used for 
translating virtual to physical address. A virtual address is formed from the EA generated 
during an instruction fetch operation, and the SR[CID] field. Together with the 
ITLBWyMR registers and match logic they form a core part of the IMMU. 

































































Bit 31-13 1228 | 7 
Identifier PPN Reserved UXE 
Reset X Xx Xx 

R/W R/W R/W R/W 
Bit 6 5 4 3 2 1 0 
Identifier SXE D A WOM WBC Cl CC 
Reset Xx X Xx X Xx Xx X 
R/W R/W R/IW R/W R/IW R/W R/W R/W 
CC Cache Coherency 


O Data cache coherency is not enforced for this page 
1 Data cache coherency is enforced for this page 





Cl Cache Inhibit 


0 Cache is enabled for this page 
1 Cache is disabled for this page 
WBC Write-Back Cache 
0 Data cache uses write-through strategy for data from this page 
1 Data cache uses write-back strategy for data from this page 
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WOM Weakly-Ordered Memory 
O Strongly-ordered memory model for this page 
1 Weakly-ordered memory model for this page 





A Accessed 
O Page was not accessed 
1 Page was accessed 
D Dirty 
O Page was not modified 
1 Page was modified 
SXE Supervisor Execute Enable x 
O Instruction fetch operation in supervisor mode not permitted 
1 Instruction fetch operation in supervisor mode permitted 
UXE User Execute Enable x 
O Instruction fetch operation in user mode not permitted 
1 Instruction fetch operation in user mode permitted 
PPN Physical Page Number 
O-N Number of the physical frame in memory 











Table 8-10. ITLBWyTR Field Descriptions 


8.4.9 Instruction/Data Area Translation Buffer Match 
Registers (xATBMRO-xATBMR3) 


The xATBMR registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


Together with the xATBTR registers they cache translation entries used for 
translating virtual to physical address of large address space areas. A virtual address is 
formed from the EA generated during an instruction fetch or load/store operation, and the 
SR[CID] field. xATBMR registers hold a tag that is compared with the current virtual 
address generated by the CPU core. Together with the xATBTR registers and match logic 
they form a core part of the xMMU. 




































































Bit 31-10 
Identifier VPN 
Reset Xx 
R/W R/W 
Bit 9-5 4-1 0 
Identifier Reserved CID Vv 
Reset X 0 0 
R/W R | RW | RW RIW 
V Valid 
O TLB entry invalid 
1 TLB entry valid 
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CID Context ID 
0-15 TLB entry translates for CID 
PS Page Size 


0 16 Mbyte page 
1 32 Gbyte page 
Virtual Page Number 
O-N Number of the virtual frame that must match EA 





VPN 











Table 8-11. xATBMR Field Descriptions 


The CID bits can be hardwired to zero if the implementation does not support fast 
context switching and SR[CID] bits. 


Data Area Translation Buffer Translate 
Registers (DATBTRO-DATBTR3) 


The DATBTR registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

Together with the DATBMR registers they cache translation entries used for 
translating virtual to physical address. A virtual address is formed from the EA generated 
during a load/store operation, and the SR[CID] field. Together with the DATBMR 
registers and match logic they form a core part of the DMMU. 


8.4.10 














Bit 31-10 9 8 7 
Identifier PPN UWE URE SWE 
Reset xX x x Xx 

R/W R/W R/W R/W R/W 





























































Identifier 
Reset X Xx X Xx X x 
R/W R/W R/W R/W R/W R/W R/W 
CC Cache Coherency 


0 Data cache coherency is not enforced for this page 
1 Data cache coherency is enforced for this page 





Cl Cache Inhibit 
0 Cache is enabled for this page 
1 Cache is disabled for this page 
WBC Write-Back Cache 


0 Data cache uses write-through strategy for data from this page 
1 Data cache uses write-back strategy for data from this page 
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WOM Weakly-Ordered Memory 
0 Strongly-ordered memory model for this page 
1 Weakly-ordered memory model for this page 





A Accessed 
O Page was not accessed 
1 Page was accessed 
D Dirty 
O Page was not modified 
1 Page was modified 
SRE Supervisor Read Enable x 
0 Load operation in supervisor mode not permitted 
1 Load operation in supervisor mode permitted 
SWE Supervisor Write Enable x 
O Store operation in supervisor mode not permitted 
1 Store operation in supervisor mode permitted 
URE User Read Enable x 
0 Load operation in user mode not permitted 
1 Load operation in user mode permitted 
UWE User Write Enable x 
O Store operation in user mode not permitted 
1 Store operation in user mode permitted 








PPN Physical Page Number 
O-N Number of the physical frame in memory 











Table 8-12. DATBTR Field Descriptions 


8.4.11 Instruction Area Translation Buffer Translate 
Registers (IATBTRO-IATBTR3) 


The IATBTR registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

Together with the IATBMR registers they cache translation entries used for 
translating virtual to physical address. A virtual address is formed from the EA generated 
during an instruction fetch operation, and the SR[CID] field. Together with the IATBMR 
registers and match logic they form a core part of the IMMU. 





































Bit 31-10 9-8 7 
Identifier PPN Reserved UXE 
Reset xX x X 

R/W R/W R/W R/W 
Bit 6 5 4 2 1 0 
Identifier SXE D A WBC Cl CC 
Reset 
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CC Cache Coherency 
0 Data cache coherency is not enforced for this page 
1 Data cache coherency is enforced for this page 
Cl Cache Inhibit 
0 Cache is enabled for this page 
1 Cache is disabled for this page 
WBC Write-Back Cache 
0 Data cache uses write-through strategy for data from this page 
1 Data cache uses write-back strategy for data from this page 
WOM Weakly-Ordered Memory 
0 Strongly-ordered memory model for this page 
1 Weakly-ordered memory model for this page 





A Accessed 
O Page was not accessed 
1 Page was accessed 
D Dirty 
O Page was not modified 
1 Page was modified 





SXE Supervisor Execute Enable x 
O Instruction fetch operation in supervisor mode not permitted 
1 Instruction fetch operation in supervisor mode permitted 
UXE User Execute Enable x 
O Instruction fetch operation in user mode not permitted 
1 Instruction fetch operation in user mode permitted 





PPN Physical Page Number 
O-N Number of the physical frame in memory 











Table 8-13. IATBTR Field Descriptions 


8.5 Address Translation Mechanism in 32-bit 


Implementations 


Memory in an OpenRISC 1000 implementation with 32-bit effective addresses 
(EA) is divided into level 1 and level 2 pages. Translation is therefore based on two-level 
page table. However for virtual memory areas that do not need the smallest 8KB page 
granularity, only one level can be used. 
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Virtual Address 
Space 
2°36 bytes 





Truncated 
Effective Address 
Space per Process 
2/32 
bytes 


Level 1 Page 





2°24 bytes 








Effective Address 
Space per Process 





Level 1 Page 


Level 2 Page 
2"3 bytes 





Level 2 Page 





























Figure 8-2. Memory Divided Into L1 and L2 pages 


The first step in page address translation is to append the current SR[CID] bits as 
most significant bits to the 32-bit effective address, combining them into a 36-bit virtual 
address. This virtual address is then used to locate the correct page table entry (PTE) in 
the page tables in the memory. The physical page number is then extracted from the PTE 
and used in the physical address. Note that for increased performance, most processors 
implement on-chip translation lookaside buffers (TLBs) to cache copies of the recently- 
used PTEs. 
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35 31 24 23 13 (12 0 


Context ID Page Index Level 1 Page Index Level 2 Page Offset 
(4 bits) (8 bits) (11 bits) (13 bits) 









Virtual Page Number (VPN) 











L1 Page Directory 





0 
L2 Page Table 








Page Table 


Base Address 
depending on 
current CID 
































34 Y 0 
Physical Page Number Page Offset 
(22 bits) (13 bits) 


Figure 8-3. Address Translation Mechanism using Two-Level Page Table 


Figure 8-3 shows an overview of the two-level page table translation of a virtual 


address to a physical address: 


v 


v 


Bits 35..32 of the virtual address select the page tables for the current context 
(process) 


Bits 31..24 of the virtual address correspond to the level 1 page number within the 
current context’s virtual space. The L1 page index is used to index the L1 page 
directory and to retrieve the PTE from it, or together with the L2 page index to match 
for the PTE in on-chip TLBs. 

Bits 23..13 of the virtual address correspond to the level 2 page number within the 
current context’s virtual space. The L2 page index is used to index the L2 page table 
and to retrieve the PTE from it, or together with the L1 page index to match for the 
PTE in on-chip TLBs. 

Bits 12..0 of the virtual address are the byte offset within the page; these are 
concatenated with the PPN field of the PTE to form the physical address used to 
access memory 


The OpenRISC 1000 two-level page table translation also allows implementation 


of segments with only one level of translation. This greatly reduces memory requirements 
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for the page tables since large areas of unused virtual address space can be covered only 
by level 1 PTEs. 


Context ID Page Index Level 1 Page Offset 


(4 bits) (8 bits) (24 bits) 





| Virtual Page Number (VPN) | 


(+) L1 Page Table 

















Page Table PTE1 
Base Address — 
depending on 

current CID 











255 








Truncated Physical Page Number Page Offset 
(11 bits) (24 bits) 








Figure 8-4. Address Translation Mechanism using only L1 Page Table 


Figure 8-4 shows an overview of the one-level page table translation of a virtual 
address to physical address: 


Y Bits 35..32 of the virtual address select the page tables for the current context 
(process) 


¥Y Bits 31..24 of the virtual address correspond to the level 1 page number within the 
current context’s virtual space. The L1 page index is used to index the L1 page table 
and to retrieve the PTE from it, or to match for the PTE in on-chip TLBs. 


Y Bits 23..0 of the virtual address are the byte offset within the page; these are 
concatenated with the truncated PPN field of the PTE to form the physical address 
used to access memory 
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8.6 Address Translation Mechanism in 64-bit 


Implementations 


Memory in OpenRISC 1000 implementations with 64-bit effective addresses (EA) 
is divided into level 0, level 1 and level 2 pages. Translation is therefore based on three- 
level page table. However for virtual memory areas that do not need the smallest page 
granularity of 8KB, two level translation can be used. 


Virtual Address 








Space 
2°50 bytes 
Truncated 
Effective 
Address Space 
| per Process / 
; 26 bytes i 
} yt / | Level 0 Page 
i ma 2°35 bytes oa 
! ra ra Level 1 Page 
} / ra 2"*4 bytes 
Effective 
Pers Level 0 Page Level 1 Page Level 2 Page 
ress Space Level 2 Page 2" bytes 
per Process q 























Figure 8-5. Memory Divided Into LO, L1 and L2 pages 


The first step in page address translation is truncation of the 64-bit effective 
address into a 46-bit address. Then the current SR[CID] bits are appended as most 
significant bits. The 50-bit virtual address thus formed is then used to locate the correct 
page table entry (PTE) in the page tables in the memory. The physical page number is 
then extracted from the PTE and used in the physical address. Note that for increased 
performance, most processors implement on-chip translation lookaside buffers (TLBs) to 
cache copies of the recently-used PTEs. 
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Figure 8-6. Address Translation Mechanism using Three-Level Page Table 


Figure 8-6 shows an overview of the three-level page table translation of a virtual 


address to physical address: 


v 


v 


Bits 49..46 of the virtual address select the page tables for the current context 
(process) 


Bits 45..35 of the virtual address correspond to the level 0 page number within current 
context’s virtual space. The LO page index is used to index the LO page directory and 
to retrieve the PTE from it, or together with the L1 and L2 page indexes to match for 
the PTE in on-chip TLBs. 


Bits 34..24 of the virtual address correspond to the level 1 page number within the 
current context’s virtual space. The L1 page index is used to index the L1 page 
directory and to retrieve the PTE from it, or together with the LO and L2 page indexes 
to match for the PTE in on-chip TLBs. 


Bits 23..13 of the virtual address correspond to the level 2 page number within the 
current context’s virtual space. The L2 page index is used to index the L2 page table 
and to retrieve the PTE from it, or together with the LO and L1 page indexes to match 
for the PTE in on-chip TLBs. 


Bits 12..0 of the virtual address are the byte offset within the page; these are 
concatenated with the truncated PPN field of the PTE to form the physical address 
used to access memory 
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The OpenRISC 1000 three-level page table translation also allows implementation 
of large segments with two levels of translation. This greatly reduces memory 
requirements for the page tables since large areas of unused virtual address space can be 
covered only by level 1 PTEs. 
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Figure 8-7. Address Translation Mechanism using Two-Level Page Table 


Figure 8-7 shows an overview of the two-level page table translation of a virtual 
address to physical address: 


v Bits 49..46 of the virtual address select the page tables for the current context 
(process) 


¥ Bits 45..35 of the virtual address correspond to the level 0 page number within the 
current context’s virtual space. The LO page index is used to index the LO page 
directory and to retrieve the PTE from it, or together with the L1 page index to match 
for the PTE in on-chip TLBs. 


¥ Bits 34..24 of the virtual address correspond to the level 1 page number within the 
current context’s virtual space. The L1 page index is used to index the L1 page table 
and to retrieve the PTE from it, or together with the LO page index to match for the 
PTE in on-chip TLBs. 
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Y Bits 23..0 of the virtual address are the byte offset within the page; these are 
concatenated with the truncated PPN field of the PTE to form the physical address 
used to access memory 


8.7 Memory Protection Mechanism 


After a virtual address is determined to be within a page covered by the valid PTE, 
the access is validated by the memory protection mechanism. If this protection 
mechanism prohibits the access, a page fault exception is generated. 

The memory protection mechanism allows selectively granting read access, write 
access or execute access for both supervisor and user modes. The page protection 
mechanism provides protection at all page level granularities. 




















Protection attribute Meaning 

DMMUPR[SREx] Enable load operations in supervisor mode to the page. 
DMMUPR[SWEx] Enable store operations in supervisor mode to the page. 
IMMUPR[SXEx] Enable execution in supervisor mode of the page. 
DMMUPR[UREx] Enable load operations in user mode to the page. 
DMMUPR[UWEx] Enable store operations in user mode to the page. 
IMMUPR[UXEx] Enable execution in user mode of the page. 


Table 8-14. Protection Attributes 


Table 8-14 lists page protection attributes defined in MMU protection registers. 
For the individual page the appropriate strategy out of seven possible strategies 
programmed in MMU protection registers is selected with the PPI field of the PTE. 

In OpenRISC 1000 processors that do not implement TLB/ATB reload in 
hardware, protection registers are not needed. 





DMMUPR | }@-—— bP» 
Protection groups SRE 




















PPI 


Figure 8-8. Selection of Page Protection Attributes for Data Accesses 
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Figure 8-9. Selection of Page Protection Attributes for Instruction Fetch Accesses 


8.8 Page Table Entry Definition 


Page table entries (PTEs) are generated and placed in page tables in memory by 
the operating system. A PTE is 32 bits wide and is the same for 32-bit and 64-bit 
OpenRISC 1000 processor implementations. 


A PTE translates a virtual memory area into a physical memory area. How much 
virtual memory is translated depends on which level the PTE resides. PTEs are either in 
page directories with L bit zeroed or in page tables with L bit set. PTEs in page 
directories point to next level page directory or to final page table that containts PTEs for 
actual address translation. 


31 10 9 8 6 5 4 3 2 Hl 0 





Physical Page Number PP Index 
(22 bits) : (3 bits) D | A |wom|wec} cl | cc 



































Figure 8-10. Page Table Entry Format 


CC Cache Coherency 
0 Data cache coherency is not enforced for this page 


1 Data cache coherency is enforced for this page 
Cache Inhibit 
0 Cache is enabled for this page 
1 Cache is disabled for this page 
Write-Back Cache 
0 Data cache uses write-through strategy for data from this page 
1 Data cache uses write-back strategy for data from this page 


Cl 





WBC 
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WOM Weakly-Ordered Memory 
0 Strongly-ordered memory model for this page 
1 Weakly-ordered memory model for this page 





A Accessed 
O Page was not accessed 
1 Page was accessed 
D Dirty 
O Page was not modified 
1 Page was modified 


PPI Page Protection Index 
0 PTE is invalid 
1-7 Selects a group of six bits from a set of seven protection attribute groups in 
XMMUCR 
L Last 


0 PTE from page directory pointing to next page directory/table 
1 Last PTE in a linked form of PTEs (describing the actual page) 
PPN Physical Page Number 
O-N Number of the physical frame in memory 














Table 8-15. PTE Field Descriptions 


8.9 Page Table Search Operation 


An implementation may choose to implement the page table search operation in 
either hardware or software. For all page table search operations data addresses are 
untranslated (i.e. the effective and physical base address of the page table are the same). 


When implemented in software, two TLB miss exceptions are used to handle TLB 
reload operations. Also, the software is responsible for maintaining accessed and dirty 
bits in the page tables. 


8.10Page History Recording 


The accessed (A) and dirty (D) bits reside in each PTE and keep information about 
the history of the page. The operating system uses this information to determine which 
areas of the main memory to swap to the disk and which areas of the memory to load 
back to the main memory (demand-paging). 

The accessed (A) bit resides both in the PTE in page table and in the copy of PTE 
in the TLB. Each time the page is accessed by a load, store or instruction fetch operation, 
the accessed bit is set. 


If the TLB reload is performed in software, then the software must also write back 
the accessed bit from the TLB to the page table. 

In cases when access operation to the page fails, it is not defined whether the 
accessed bit should be set or not. Since the accessed bit is merely a hint to the operating 
system, it is up to the implementation to decide. 

It is up to the operating system to determine when to explicitly clear the accessed 
bit for a given page. 
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The dirty (D) bit resides in both the PTE in page table and in the copy of PTE in 
the TLB. Each time the page is modified by a store operation, the dirty bit is set. 


If TLB reload is performed in software, then the software must also write back the 
dirty bit from the TLB to the page table. 


In cases when access operation to the page fails, it is not defined whether the dirty 
bit should be set or not. Since the dirty bit is merely a hint to the operating system, it is up 
to the implementation to decide. However implementation or TLB reload software must 
check whether page is actually writable before setting the dirty bit. 


It is up to the operating system to determine when to explicitly clear the dirty bit 
for a given page. 


8.11Page Table Updates 


Updates to the page tables include operations like adding a PTE, deleting a PTE 
and modifying a PTE. On multiprocessor systems exclusive access to the page table must 
be assured before it is modified. 

TLBs are noncoherent caches of the page tables and must be maintained 
accordingly. Explicit software synchronization between TLB and page tables is required 
so that page tables and TLBs remain coherent. 

Since the processor reloads PTEs even during updates of the page table, special 
care must be taken when updating page tables so that the processor does not accidently 
use half modified page table entries. 
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9 Cache Model & 
Cache Coherency 


This chapter describes the OpenRISC 1000 cache model and architectural control 
to maintain cache coherency in multiprocessor environment. 


Note that this chapter describes the cache model and cache coherency mechanism 
from the perspective of the programming model. As such, it describes the cache 
management principles, the cache coherency mechanisms and the cache control registers. 
The hardware implementation details that are invisible to the OpenRISC 1000 
programming model, such as cache organization and size, are not contained in the 
architectural definition. 


The function of the cache management registers depends on the implementation of 
the cache(s) and the setting of the memory/cache access attributes. For a program to 
execute properly on all OpenRISC 1000 processor implementations, software should 
assume a Harvard cache model. In cases where a processor is implemented without a 
cache, the architecture guarantees that writing to cache registers will not halt execution. 
For example a processor without cache should simply ignore writes to cache management 
registers. A processor with a Stanford cache model should simply ignore writes to 
instruction cache management registers. In this manner, programs written for separate 
instruction and data caches will run on all compliant implementations. 


9.1 Cache Special-Purpose Registers 


Table 9-1 summarizes the registers that the operating system uses to manage the 
cache(s). 

For implementations that have unified cache, registers that control the data and 
instruction caches are merged and available at the same time both as data and intruction 
cache registers. 




















GRP# |} REG#) REG NAME USER SUPV DESCRIPTION 
MODE | MODE 

3 0 DCCR - R/W Data Cache Control Register 

3 1 DCBPR W Ww Data Cache Block Prefetch Register 

3 2 DCBFR WwW Ww Data Cache Block Flush Register 

3 3 DCBIR - Ww Data Cache Block Invalidate Register 

3 4 DCBWR WwW WwW Data Cache Block Write-back 
Register 

3 5 DCBLR - Ww Data Cache Block Lock Register 

4 0 ICCR - R/W Instruction Cache Control Register 

4 1 ICBPR WwW WwW Instruction Cache Block PreFetch 
Register 

4 2 ICBIR WwW Ww Instruction Cache Block Invalidate 
Register 
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GRP #| REG#| REG NAME USER SUPV DESCRIPTION 
MODE MODE 
4 3 ICBLR - W Instruction Cache Block Lock 
Register 


Table 9-1. Cache Registers 


9.1.1 Data Cache Control Register 


The data cache control register is a 32-bit special-purpose register accessible with 
the |.mtspr/l.mfspr instructions in supervisor mode. 


The DCCR controls the operation of the data cache. 














Bit 31-8 7-0 
Identifier Reserved EW 
Reset x | 0 

R/W R | RW 














EW Enable Ways 
0000 0000 All ways disabled/locked 


1111 1111 All ways enabled/unlocked 











Table 9-2. DCCR Field Descriptions 


If data cache does not implement way locking, the DCCR is not required to be 
implemented. 


9.1.2 Instruction Cache Control Register 


The instruction cache control register is a 32-bit special-purpose register accessible 
with the |.mtspr/I.mfspr instructions in supervisor mode. 


The ICCR controls the operation of the instruction cache. 














Bit 31-8 7-0 
Identifier Reserved EW 
Reset x 0 

R/W R R/W 

















EW Enable Ways 
0000 0000 All ways disabled/locked 











1111 1111 All ways enabled/unlocked 





Table 9-3. ICCR Field Descriptions 


www.openrisc.io 1.2-1 288 of 357 








OpenCores OpenRISC 1000 Architecture Manual August 20, 2017 


If the instruction cache does not implement way locking, the ICCR is not required 
to be implemented. 


9.2 Cache Management 
This section describes special-purpose cache management registers for both data 
and instruction caches. 


Memory accesses caused by cache management are not recorded (unlike load or 
store instructions) and cannot invoke any exception. 


Instruction caches do not need to be coherent with the memory or caches of other 
processors. Software must make the instruction cache coherent with modified instructions 
in the memory. A typical way to accomplish this is: 


ils Data cache block write-back (update of the memory) 

l.csync (wait for update to finish) 

Instruction cache block invalidate (clear instruction cache block) 
Flush pipeline 


9.2.1 Data Cache Block Prefetch (Optional) 


The data cache block prefetch register is an optional special-purpose register 
accessible with the |.mtspr/l.mfspr instructions in both user and supervisor modes. It is 32 
bits wide in 32-bit implementations and 64 bits wide in 64-bit implementations. An 
implementation may choose not to implement this register and ignore all writes to this 
register. 


The DCBPR is written with the effective address and the corresponding block 
from memory is prefetched into the cache. Memory accesses are not recorded (unlike 
load or store instructions) and cannot invoke any exception. 


A data cache block prefetch is used strictly for improving performance. 


PWN 























Bit 31-0 
Identifier EA 
Reset 0 
R/W Write Only 
EA Effective Address 


EA that targets byte inside cache block 











Table 9-4. DCBPR Field Descriptions 


9.2.2 Data Cache Block Flush 


The data cache block flush register is a special-purpose register accessible with the 
Lmtspr/l.mfspr instructions in both user and supervisor modes. It is 32 bits wide in 32-bit 
implementations and 64 bits wide in 64-bit implementations. 

The DCBER is written with the effective address. If coherency is required then the 
corresponding: 
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v¥ Unmodified data cache block is invalidated in all processors. 


¥ Modified data cache block is written back to the memory and invalidated in all 
processors. 


v¥ Missing data cache block in the local processor causes that modified data cache block 
in other processor is written back to the memory and invalidated. If other processors 
have unmodified data cache block, it is just invalidated in all processors. 


If coherency is not required then the corresponding: 
¥ Unmodified data cache block in the local processor is invalidated. 


¥ Modified data cache block is written back to the memory and invalidated in local 
processor. 


v¥ Missing cache block in the local processor does not cause any action. 














Bit 31-0 
Identifier EA 
Reset 0 
R/W Write only 














EA Effective Address 
EA that targets byte inside cache block 


Table 9-5. DCBFR Field Descriptions 


9.2.3 Data Cache Block Invalidate 


The data cache block invalidate register is a special-purpose register accessible 
with the |mtspr/l.mfspr instructions in supervisor mode. It is 32 bits wide in 32-bit 
implementations and 64 bits wide in 64-bit implementations. 


The DCBIR is written with the effective address. If coherency is required then the 
corresponding: 


¥ Unmodified data cache block is invalidated in all processors. 
¥ Modified data cache block is invalidated in all processors. 


v¥ Missing data cache block in the local processor causes that data cache blocks in other 
processors are invalidated. 


If coherency is not required then corresponding: 
v¥ Unmodified data cache block in the local processor is invalidated. 
Modified data cache block in the local processor is invalidated. 
v¥ Missing cache block in the local processor does not cause any action. 


IN 
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Bit 31-0 
Identifier EA 
Reset 0 
Write Only 

















EA Effective Address 
EA that targets byte inside cache block 














Table 9-6. DCBIR Field Descriptions 


9.2.4 Data Cache Block Write-Back 


The data cache block write-back register is a special-purpose register accessible 
with the |.mtspr/l.mfspr instructions in both user and supervisor modes. It is 32 bits wide 
in 32-bit implementations and 64 bits wide in 64-bit implementations. 

The DCBWR is written with the effective address. If coherency is required then 
the corresponding data cache block in any of the processors is written back to memory if 
it was modified. If coherency is not required then the corresponding data cache block in 
the local processor is written back to memory if it was modified. 




















Identifier EA 
Reset 0 
R/W Write Only 














EA Effective Address 
EA that targets byte inside cache block 


Table 9-7. DCBWR Field Descriptions 


9.2.5 Data Cache Block Lock (Optional) 


The data cache block lock register is an optional special-purpose register 
accessible with the |.mtspr/l.mfspr instructions in both user and supervisor modes. It is 32 
bits wide in a 32-bit implementation and 64 bits wide in a 64-bit implementation. 

The DCBLR is written with the effective address. The corresponding data cache 
block in the local processor is locked. 

If all blocks of the same set in all cache ways are locked, then the cache refill may 
automatically unlock the least-recently used block. 




















Bit 31-0 
Identifier EA 
Reset 0 
Write Only 
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EA Effective Address 
EA that targets byte inside cache block 


Table 9-8. DCBLR Field Descriptions 


9.2.6 Instruction Cache Block Prefetch (Optional) 


The instruction cache block prefetch register is an optional special-purpose register 
accessible with the |.mtspr/l.mfspr instructions in both user and supervisor modes. It is 32 
bits wide in 32-bit implementations and 64 bits wide in 64-bit implementations. An 
implementation may choose not to implement this register and ignore all writes to this 
register. 

The ICBPR is written with the effective address and the corresponding block from 
memory is prefetched into the instruction cache. 


Instruction cache block prefetch is used strictly for improving performance. 



































Bit 31-0 
Identifier EA 
Reset 0 
R/W Write Only 
EA Effective Address 
EA that targets byte inside cache block 








Table 9-9. ICBPR Field Descriptions 


9.2.7 Instruction Cache Block Invalidate 


The instruction cache block invalidate register is a special-purpose register 
accessible with the |.mtspr/l.mfspr instructions in both user and supervisor modes. It is 32 
bits wide in 32-bit implementations and 64 bits wide in 64-bit implementations. 


The ICBIR is written with the effective address. If coherency is required then the 
corresponding instruction cache blocks in all processors are invalidated. If coherency is 
not required then the corresponding instruction cache block is invalidated in the local 
processor. 




















Bit 31-0 
Identifier EA 
Reset 0 
R/W Write Only 














EA Effective Address 
EA that targets byte inside cache block 


Table 9-10. ICBIR Field Descriptions 
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9.2.8 Instruction Cache Block Lock (Optional) 


The instruction cache block lock register is an optional special-purpose register 
accessible with the |.mtspr/l.mfspr instructions in both user and supervisor modes. It is 32 
bits wide in 32-bit implementations and 64 bits wide in 64-bit implementations. 

The ICBLR is written with the effective address. The corresponding instruction 
cache block in the local processor is locked. 

If all blocks of the same set in all cache ways are locked, then the cache refill may 
automatically unlock the least-recently used block. 


Missing cache block in the local processor does not cause any action. 


























Identifier EA 
Reset 0 
R/W Write Only 











EA Effective Address 
EA that targets byte inside cache block 


Table 9-11. ICBLR Field Descriptions 


9.3 Cache/Memory Coherency 

The primary role of the cache coherency system is to synchronize cache content 
with other caches and with the memory and to provide the same image of the memory to 
all devices using the memory. 

The architecture provides several features to implement cache coherency. In 
systems that do not provide cache coherency with the PTE attributes (because they do not 
implement a memory management unit), it may be provided through explicit cache 
management. 

Cache coherency in systems with virtual memory can be provided on a page-by- 
page basis with PTE attributes. The attributes are: 
¥ Cache Coherent (CC Attribute) 
¥ Caching-Inhibited (CI Attribute) 
v¥ Write-Back Cache (WBC Attribute) 


When the memory/cache attributes are changed, it is imperative that the cache 
contents should reflect the new attribute settings. This usually means that cache blocks 
must be flushed or invalidated. 
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9.3.1 Pages Designated as Cache Coherent Pages 


This attribute improves performance of the systems where cache coherency is 
performed with hardware and is relatively slow. Memory pages that do not need cache 
coherency are marked with CC=0 and only memory pages that need cache coherency are 
marked with CC=1. When an access to shared resource is made, the local processor will 
assert some kind of cache coherency signal and other processors will respond if they have 
a copy of the target location in their caches. 


To improve performance of uniprocessor systems, memory pages should not be 
designated as CC=1. 


9.3.2 Pages Designated as Caching-Inhibited Pages 


Memory accesses to memory pages designated with CI=1 are always performed 
directly into the main memory, bypassing all caches. Memory pages designated with 
CI=1 are not loaded into the cache and the target content should never be available in the 
cache. To prevent any accident copy of the target location in the cache, whenever the 
operating system sets a memory page to be caching-inhibited, it should flush the 
corresponding cache blocks. 


Multiple accesses may be merged into combined accesses except when individual 
accesses are separated by I.msync or I.csync or I.psync. 


9.3.3 Pages Designated as Write-Back Cache Pages 


Store accesses to memory pages designated with WBC=0 are performed both in 
data cache and memory. If a system uses multilevel hierarchy caches, a store must be 
performed to at least the depth in the memory hierarchy seen by other processors and 
devices. 


Multiple stores may be merged into combined stores except when individual stores 
are separated by I.msync or l.sync or l.psync. A store operation may cause any part of the 
cache block to be written back to main memory. 

Store accesses to memory pages designated with WBC=1 are performed only to 
the local data cache. Data from the local data cache can be copied to other caches and to 
main memory when copy-back operation is required. WBC=1 improves system 
performance, however it requires cache snooping hardware support in data cache 
controllers to guarantee cache coherency. 
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10 Multicore Support 


This chapter describes the OpenRISC 1000 support for multicore system 
configurations. This section is targeted at hardware integrators and operating system 
designers. 


10.1Introduction 


Multicore support is made possible by architecture facilities which include: 


> Atomic memory operations as described in the Atomicity section 

> Cache Coherency between multiple cores as described in the Cache/Memory 
Coherency section 

> Core Identification registers to identify which processor is running 


For a CPU architecture these features should be enough, but for a system design 
there are some additional considerations that need to be made. This chapter introduces 
some suggestions for OpenRISC multicore architectures to handle: 


> Inter processor communication 
> Multicore bootstrapping 
> Timer Synchronization 


10.2Inter Processor Communication 


In a multicore configuration each processor needs a way to communicate with 
other processors. This is needed for message sending and interrupt balancing. In 
OpenRISC each core has a full interrupt controller with 32 interrupt lines as described in 
Programmable Interrupt Controller (Optional). In multicore configurations the internal 
PIC is leveraged by routing all interrupts to all cores. The Open Multi-Processor 
Interrupt Controller (OMPIC) is a memory mapped programmable interrupt source 
providing a mechanism for Inter Processor Interrupts (IPI) enabling message sending and 
interrupt balancing. 


The OMPIC supports up to 8192 cores via 13 bit DST_CORE addressing field. 
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Figure 10-1: Multicore Interconnect with OMPIC 


The above Figure 10-1 shows a multicore system connected with OMPIC. The 
example uart device is connected to each core. The wart interrupt would typically be 
masked in all but one core. 

Each core communicates with other cores by writing requests to it’s own 
designated control register specifying the destination core in DST_CORE. When the 
control register is written to with the IRQ_GEN field asserted the associated IRQ line 
will be raised to signal the destination core of a pending message. In the destination 
core’s interrupt handler it shall write to its own control register with IRQ_ACK asserted 
to clear the interrupt. It will then read it’s own status register to receive the data 
message. It is important to ack the IRQ then read the status register in this order to 
ensure messages are not lost. 


10.2.1 |OMPIC Control Registers 


The OMPIC control registers are registers which are written to to send messages 
to another core or to Acknowledge interrupts. Cores will typically only write to their own 
control register. It is not typically useful to read from the control register 


The status registers are located at memory mapped addresses $OMPIC_BASE + 
(Core ID) * 8. For example: 


* $OMPIC_BASE + 0x0 
* $OMPIC_BASE + 0x8 
¢ $OMPIC_BASE + 0x10 
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Bit 31 30 29-16 15-0 
Identifier IRQ_ACK IRQ_GEN DST_CORE DATA 
Reset 0 0 0 0 
R/W WwW W W WwW 
IRQ_ACK IRQ Acknowledge 
If asserted, clears the IRQ of the core associated with this control register. 
IRQ_GEN IRQ Generate 
If asserted, raises the IRQ of the core designated by DST_CORE. 
DST_CORE Destination Core 
The core to perform the operation on. 
DATA Data 
The data to send to the destination core. 
Table 10-1. OMPIC Control Field Descriptions 
10.2.2 OMPIC Status Registers 


The OMPIC status registers are read only registers which are updated upon writes 
to control registers. 


The status registers are located at memory mapped addresses $OMPIC_BASE + 
( (Core ID) * 8 ) + 4. For example: 
¢ $OMPIC_BASE + 0x4 
¢ $OMPIC_BASE + Oxc 
* $OMPIC_BASE + 0x14 


















































Bit 31 30 29-16 15-0 
Identifier Reserved IRQ_PEND SRC_CORE DATA 
Reset Xx 0 0 0 
R/W - R R R 
IRQ_PEND IRQ Pending 
Signals that the IRQ for this core is pending to be serviced. 
SRC Source Core 
The core that sent the last message to this core. 
DATA Data 
The pending data to be received for this core. 
Table 10-2. OMPIC Status Field Descriptions 
10.3Temporary Storage 


During exception handling it is often required to temporarily store register values 
before the exception stack frame is initialized or even in the case that no stack is required. 
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It is recommended to use Shadow Registers for this temporary storage mechanism. For 
example: 

#define SPR_GPR_BASE (QO + 1024) 

#define SPR_SHADOW_GPR(x) ((x) + SPR_GPR_BASE + 32) 


1l.mtspr r0,r5,SPR_SHADOW_GPR(5) 
handle exception 

l.mfspr r5,r0,SPR_SHADOW_GPR(5) 

l.rfe 


Note, if this method is used it means that fast context switching cannot be used 
with the multicore system. 


10.4Multicore bootstrapping 


When booting a multicore OpenRISC system, upon reset all cores will begin 
execution at the reset vector. It is recommended that the core 0, the primary core, 
performs all hardware initialization and signals the secondary cores to initialize. 


The secondary cores should wait to initialize until a signal is received from the 
primary core. Secondary cores can wait during to initialize by either spinning waiting for 
the initialization signal or by engaging the Power Management Doze mode and waiting 
for an interrupt. 


The initialization signal is typically a variable stored in memory. The variable 
initially will be 0, the primary core will set it to the id of the core to initialize signaling 
each core to boot one by one. 


10.5Timer Synchronization 


When running a multicore OpenRISC system it is typically useful for the Tick 
Timers of all cores to be synchronized. This is not guaranteed as processor reset and 
timer enablement may not have been triggered at the same time. To synchronize the Tick 
Timer it is recommended to either provide an external global timer device or use a 
software synchronization routine. 
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11 Debug Unit (Optional) 


This chapter describes the OpenRISC 1000 debug facility. The debug unit assists 
software developers in debugging their systems. It provides support for watchpoints, 
breakpoints and program-flow control registers. 


Watchpoints and breakpoint are events triggered by program- or data-flow 
matching the conditions programmed in the debug registers. Watchpoints do not interfere 
with the execution of the program-flow except indirectly when they cause a breakpoint. 
Watchpoints can be counted by Performance Counters Unit. 


Breakpoint, unlike watchpoints, also suspends execution of the current program- 
flow and start trap exception processing. Breakpoint is optional consequence of 
watchpoints. 


11.1Features 


The OpenRISC 1000 architecture defines eight sets of debug registers. Additional 
debug register sets can be defined by the implementation itself. The debug unit is optional 
and the presence of an implementation is indicated by the UPR[DUP] bit. 


v¥ Optional implementation 
v Eight architecture defined sets of debug value/compare registers 


v¥ Match signed/unsigned conditions on instruction fetch EA, load/store EA and 
load/store data 


¥ Combining match conditions for complex watchpoints 

v¥ Watchpoints can be counted by Performance Counters Unit 

v¥ Watchpoints can generate a breakpoint (trap exception) 

¥ Counting watchpoints for generation of additional watchpoints 


DVR/DCR pairs are used to compare instruction fetch or load/store EA and 
load/store data to the value stored in DVRs. Matches can be combined into more complex 
matches and used for generation of watchpoints. Watchpoints can be counted and 
reported as breakpoint. 
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DVRO/DCRO 


Watchpoints 


Breakpoints 






Data Cache 


Figure 11-1. Block Diagram of Debug Support 


11.2Debug Value Registers (DVRO-DVR7) 


The debug value registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

The DVRs are programmed with the watchpoint addresses or data by the resident 
debug software or by the development interface. Their value is compared to the fetch or 
load/store EA or to the load/store data according to the corresponding DCR. Based on the 
settings of the corresponding DCR a watchpoint is generated. 











Bit 31-0 
Identifier 
Reset 
R/W R/W 








VALUE Watchpoint/Breakpoint Address/Data 


Table 11-1. DVR Field Descriptions 








11.3Debug Control Registers (DCRO-DCR7) 
The debug control registers are 32-bit special-purpose supervisor-level registers 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


The DCRs are programmed with the watchpoint settings that define how DVRs are 
compared to the instruction fetch or load/store EA or to the load/store data. 
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Bit 31-8 7-5 4 3-1 0 
Identifier Reserved CT SC CC DP 
Reset Xx 0 0 0 0 

R/W 





























DP DVR/DCR Present 
0 Corresponding DVR/DCR pair is not present 
1 Corresponding DVR/DCR pair is present 


Compare Condition 
000 Masked 
001 Equal 
010 Less than 
011 Less than or equal 
100 Greater than 
101 Greater than or equal 
110 Not equal 
111 Reserved 


Signed Comparison 
0 Compare using unsigned integers 
1 Compare using signed integers 
Compare To 
000 Comparison disabled 
001 Instruction fetch EA 
010 Load EA 
011 Store EA 
100 Load data 
101 Store data 
110 Load/Store EA 
111 Load/Store data 


CC 





Sc 


CT 














Table 11-2. DCR Field Descriptions 


11.4Debug Mode Register 1 (DMR1) 

The debug mode register 1 is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

The DMR1 is programmed with the watchpoint/breakpoint settings that define 
how DVR/DCR pairs operate and is set by the resident debug software or by the 
development interface. 

















Bit 31-25 23 22 21-20 19-18 17-16 
Identifier Reserved BT ST Res Cw9 CW8 
Reset Xx 0 0 0 0 0 

R/W R R/W R/W R/W R/W R/W 
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Bit 


15-14 


13-12 


11-10 


9-8 7-6 5-4 


3-2 


1-0 





Identifier 


CW7 


CW6 


CW5 


CW4 CW3 CW2 


CWw1 


CWO 





Reset 
R/W 





R/W 





R/W 





R/W 


0 0 0 
R/W R/W R/W 














R/W 








R/W 











CWO 


Chain Watchpoint 0 


00 Watchpoint 0 = Match 0 
01 Watchpoint 0 = Match O & External Watchpoint 
10 Watchpoint 0 = Match 0 | External Watchpoint 


11 Reserved 





CW1 


Cw2 


CW3 


Cw4 


Chain Watchpoint 1 


00 Watchpoint 1 = Match 1 


01 Watc 


hpoint 1 = Match 1 & Watchpoint 0 


10 Watchpoint 1 = Match 1 | Watchpoint 0 


11 Reserved 
Chain Watchpoint 2 


00 Watchpoint 2 = Match 2 
01 Watchpoint 2 = Match 2 & Watchpoint 1 
10 Watchpoint 2 = Match 2 | Watchpoint 1 


11 Reserved 


Chain Watchpoint 3 
00 Watchpoint 3 = Match 3 


01 Watchpoint 3 = Match 3 & Watchpoint 2 
10 Watchpoint 3 = Match 3 | Watchpoint 2 


11 Reserved 


Chain Watchpoint 4 
00 Watchpoint 4 = Match 4 


01 Watchpoint 4 = Match 4 & External Watchpoint 
10 Watchpoint 4 = Match 4 | External Watchpoint 


11 Reserved 





CW5 


Chain Watchpoint 5 
00 Watchpoint 5 = Match 5 


01 Watchpoint 5 = Match 5 & Watchpoint 4 
10 Watchpoint 5 = Match 5 | Watchpoint 4 


11 Reserved 





CW6 


Chain Watchpoint 6 
00 Watchpoint 6 = Match 6 


01 Watchpoint 6 = Match 6 & Watchpoint 5 
10 Watchpoint 6 = Match 6 | Watchpoint 5 


11 Reserved 





CW7 
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Chain Watchpoint 7 
00 Watchpoint 7 = Match 7 


01 Watchpoint 7 = Match 7 & Watchpoint 6 


10 Wat 


chpoint 7 = Match 7 | Watchpoint 6 
11 Reserved 
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CW8 Chain Watchpoint 8 
00 Watchpoint 8 = Watchpoint counter 0 match 
01 Watchpoint 8 = Watchpoint counter 0 match & Watchpoint 3 
10 Watchpoint 8 = Watchpoint counter 0 match | Watchpoint 3 
11 Reserved 


Cw9 Chain Watchpoint 9 
00 Watchpoint 9 = Watchpoint counter 1 match 
01 Watchpoint 9 = Watchpoint counter 1 match & Watchpoint 7 
10 Watchpoint 9 = Watchpoint counter 1 match | Watchpoint 7 
11 Reserved 


ST Single-step Trace 
0 Single-step trace disabled 
1 Every executed instruction causes trap exception 
BT Branch Trace 
0 Branch trace disabled 
1 Every executed branch instruction causes trap exception 











Table 11-3. DMR1 Field Descriptions 


11.5Debug Mode Register 2(DMR2) 


The debug mode register 2 is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

The DMR2 is programmed with the watchpoint/breakpoint settings that define 
which watchpoints generate a breakpoint and which watchpoint counters are enabled. 
When a breakpoint happens WBS provides information which watchpoint or several 
watchpoints caused breakpoint condition. WBS bits are sticky and should be cleared by 
writing 0 ot them every time a breakpoint condition is processed. DMR2 is set by the 
resident debug software or by the development interface. 









































Bit 31-22 21-12 11-2 1 0 
Identifier WBS WGB AWTC WCE1 WCEO 
Reset 0 0 0 0 0 
R/W R R/W R/W R/IW R/IW 
WCEO Watchpoint Counter Enable 0 


O Counter O disabled 
1 Counter 0 enabled 





WCE1 Watchpoint Counter Enable 1 
O Counter 1 disabled 
1 Counter 1 enabled 
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AWTC Assign Watchpoints to Counter 
00 0000 0000 All Watchpoints increment counter 0 


00 0000 0001 Watchpoint 0 increments counter 1 


00 0000 1111 First four watchpoints increment counter 1, rest increment 
counter 0 


11 1111 1111 All watchpoints increment counter 1 


WGB Watchpoints Generating Breakpoint (trap exception) 
00 0000 0000 Breakpoint disabled 


00 0000 0001 Watchpoint 0 generates breakpoint 
01 0000 0000 Watchpoint counter 0 generates breakpoint 


11 1111 1111 All watchpoints generate breakpoint 
Watchpoints Breakpoint Status 
00 0000 0000 No watchpoint caused breakpoint 
00 0000 0001 Watchpoint 0 caused breakpoint 





WBS 


01 0000 0000 Watchpoint counter 0 caused breakpoint 











11 1111 1111 Any watchpoint could have caused breakpoint 
Table 11-4. DMR2 Field Descriptions 


11.6Debug Watchpoint Counter Register (DWCRO- 
DWCR1) 


The debug watchpoint counter registers are 32-bit special-purpose supervisor-level 
registers accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


The DWCRs contain 16-bit counters that count watchpoints programmed in the 
DMR. The value in a DWCR can be accessed by the resident debug software or by the 
development interface. DWCRs also contain match values. When a counter reaches the 
match value, a watchpoint is generated. 











Bit 15-0 
Identifier COUNT 
Reset 0 
R/W R/W 











COUNT Number of watchpoints programmed in DMR 
N 16-bit counter of generated watchpoints assigned to this counter 
MATCH N 16-bit value that when matched generates a watchpoint 


Table 11-5. DWCR Field Descriptions 
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11.7Debug Stop Register (DSR) 


The debug stop register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 
The DSR specifies which exceptions cause the core to stop the execution of the 
exception handler and turn over control to development interface. It can be programmed 
by the resident debug software or by the development interface. 































































































Bit 31-14 | 42 | 10 9 8 
Identifier Reserved TE FPE SCE RE IME DME 
Reset 0 0 0 
R/W R/W R/W R/W 
Bit 7 6 5 | 4 3 2 1 0 
Identifier INTE NE AE | TTE IPFE DPFE | BUSEE | RSTE 
Reset 0 0 0 | oO 0 0 0 0 

R/W R/W R/W R/W | R/W R/W R/W R/IW R/W 
RSTE Reset Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
BUSEE Bus Error Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
DPFE Data Page Fault Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
IPFE Instruction Page Fault Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
TTE Tick Timer Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
AE Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
NE Illegal Instruction Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
INTE Interrupt Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
DME DTLB Miss Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
www.openrisc.io 1.2-1 305 of 357 














OpenCores 


IME 


OpenRISC 1000 Architecture Manual August 20, 2017 


ITLB Miss Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 





RE 


SCE 


FPE 


Range Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 


System Call Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 
Floating Point Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 





TE 





Trap Exception 
0 This exception does not transfer control to the development I/F 
1 This exception transfers control to the development interface 





Table 11-6. DSR Field Descriptions 


11.8Debug Reason Register (DRR) 


The debug reason register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

The DRR specifies which event caused the core to stop the execution of program 
flow and turned control over to the development interface. It should be cleared by the 
resident debug software or by the development interface. 
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Bit 31-14 13 12 11 10 9 8 
Identifier Reserved TE FPE SCE RE IME DME 
Reset 0 0 0 0 0 0 
R/W R/W R/W R/W R/W R/W R/W 

Bit 2 1 0 
Identifier DPFE | BUSEE |} RSTE 
Reset 0 0 0 
R/W R/W R/W R/W 
RSTE Reset Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
BUSEE Bus Error Exception 
O This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
DPFE Data Page Fault Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
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IPFE Instruction Page Fault Exception 
O This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
TTE Tick Timer Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
AE Alignment Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 





NE Illegal Instruction Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
INTE Interrupt Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
DME DTLB Miss Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
IME ITLB Miss Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 








RE Range Exception 
O This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 


SCE System Call Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transfered control to the development interface 
FPE Floating Point Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transferred control to the development interface 





TE Trap Exception 
0 This exception did not transfer control to the development I/F 
1 This exception transferred control to the development interface 








Table 11-7. DRR Field Descriptions 
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12 Performance Counters Unit 
(Optional) 


This chapter describes the OpenRISC 1000 performance counters facility. 
Performance counters can be used to count predefined events such as L1 instruction or 
data cache misses, branch instructions, pipeline stalls etc. 


Data from the Performance Counters Unit can be used for the following: 


¥ To improve performance by developing better application level algorithms, better 
optimized operating system routines and for improvements in the hardware architecture 
of these systems (e.g. memory subsystems). 


¥ To improve future OpenRISC implementations and add future enhancements to the 
OpenRISC architecture. 


¥ To help system developers debug and test their systems. 


12.1Features 


The OpenRISC 1000 architecture defines eight performance counters. Additional 
performance counters can be defined by the implementation itself. The Performance 
Counters Unit is optional and the presence of an implementation is indicated by the 
UPR[PCUP] bit. 


¥ Optional implementation. 

v Eight architecture defined performance counters 
v Eight custom performance counters 

v¥ Programmable counting conditions. 


12.2Performance Counters Count Registers 
(PCCRO-PCCR7) 


The performance counters count registers are 32-bit special-purpose supervisor- 
level registers accessible with the |.mtspr/l.mfspr instructions in supervisor mode. Read 
access in user mode is possible, if it is enabled in SR[ISUMRA]. 


They are counters of the events programmed in the PCMR registers. 














Bit 31-0 
Identifier COUNT 
Reset 0 
R/W R/W 




















COUNT Event counter 


Table 12-1. PCCRO Field Descriptions 
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12.3Performance Counters Mode Registers 
(PCMRO-PCMR7) 

The performance counters mode registers are 32-bit special-purpose supervisor- 
level registers accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

They define which events the performance counters unit counts. 


























































































































Bit 31-26 25-15 14 13 12 11 10 
Identifier Reserved WPE DDS | ITLBM | DTLBM BS LSUS 
Reset Xx 0 0 0 0 0 0 
R/W Read Only R/W R/W R/W R/W R/W R/W 

Bit 6 5 4 3 2 1 0 
Identifier IF SA LA CIUM | CISM | Rese | CP 
rved 
Reset 0 0 0 0 0 0 1 
R/W R/W | R/W R/W R/W R/W R/W R 
CP Counter Present 
O Counter not present 
1 Counter present 
CISM Count in Supervisor Mode 
O Counter disabled in supervisor mode 
1 Counter counts events in supervisor mode 
CIUM Count in User Mode 
O Counter disabled in user mode 
1 Counter counts events in user mode 
LA Load Access event 
O Event ignored 
1 Count load accesses 
SA Store Access event 
O Event ignored 
1 Count store accesses 
IF Instruction Fetch event 
O Event ignored 
1 Count instruction fetches 
DCM Data Cache Miss event 
O Event ignored 
1 Count data cache missed 
ICM Instruction Cache Miss event 
0 Event ignored 
1 Count instruction cache misses 
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Instruction Fetch Stall event 
O Event ignored 
1 Count instruction fetch stalls 
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LSUS 


BS 


DTLBM 


LSU Stall event 
O Event ignored 
1 Count LSU stalls 
Branch Stalls event 
O Event ignored 
1 Count branch stalls 
DTLB Miss event 
0 Event ignored 
1 Count DTLB misses 





ITLBM 


DDS 


WPE 
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ITLB Miss event 
0 Event ignored 
1 Count ITLB misses 


Data Dependency Stalls event 
0 Event ignored 
1 Count data dependency stalls 


Watchpoint Events 
000 0000 0000 All watchpoint events ignored 
000 0000 0001 Watchpoint 0 counted 


111 1111 1111 All watchpoints counted 
Table 12-2. PCMR Field Descriptions 
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13 Power Management (Optional) 


This chapter describes the OpenRISC 1000 power management facility. The power 
management facility is optional and implementation may choose which features to 
implement, and which not. UPR[PMP] indicates whether power management is 
implemented or not. 


Note that this chapter describes the architectural control of power management 
from the perspective of the programming model. As such, it does not describe technology 
specific optimizations or implementation techniques. 


13.1Features 


The OpenRISC 1000 architecture defines five architectural features for minimizing 
power consumption: 


v slow down feature 

doze mode 

sleep mode 

suspend mode 

dynamic clock gating feature 


aN 


The slow down feature takes advantage of the low-power dividers in external 
clock generation circuitry to enable full functionality, but at a lower frequency so that 
power consumption is reduced. 


The slow down feature is software controlled with the 4-bit value in PMR[SDF]. A 
lower value specifies higher expected performance from the processor core. Whether this 
value controls a processor clock frequency or some other implementation specific feature 
is irrelevant to the controlling software. Usually PMR[SDF] is dynamically set by the 
operating system’s idle routine, that monitors the usage of the processor core. 


When software initiates the doze mode, software processing on the core suspends. 
The clocks to the processor internal units are disabled except to the internal tick timer and 
programmable interrupt controller. However other on-chip blocks (outside of the 
processor block) can continue to function as normal. 

The processor should leave doze mode and enter normal mode when a pending 
interrupt occurs. 

In sleep mode, all processor internal units are disabled and clocks gated. 
Optionally, an implementation may choose to lower the operating voltage of the 
processor core. 

The processor should leave sleep mode and enter normal mode when a pending 
interrupt occurs. 

In suspend mode, all processor internal units are disabled and clocks gated. 
Optionally, an implementation may choose to lower the operating voltage of the 
processor core. 
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The processor enters normal mode when it is reset. Software may implement a 
reset exception handler that refreshes system memory and updates the RISC with the 
state prior to the suspension. 


If enabled, the clock-gating feature automatically disables clock subtrees to major 
processor internal units on a clock cycle basis. These blocks are usually the CPU, 
FPU/VU, IC, DC, IMMU and DMMU. This feature can be used in a combination with 
other power management features and low-power modes. 

Cache or MMU blocks that are already disabled when software enables this 
feature, have completely disabled clock subtrees until clock gating is disabled or until the 
blocks are again enabled. 


13.2Power Management Register (PMR) 


The power management register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


PMR is used to enable or disable power management features and modes. 

































































Bit 31-7 7 6 5 4 3-0 
Identifier Reserved SUME DCGE SME DME SDF 
Reset X 0 0 0 0 0 
R/W R R/W R/W R/W R/W R/W 
SDF Slow Down Factor 
0 Full speed 
1-15 Logarithmic clock frequency reduction 
DME Doze Mode Enable 
0 Doze mode not enabled 
1 Doze mode enabled 
SME Sleep Mode Enable 
0 Sleep mode not enabled 
1 Sleep mode enabled 
DCGE Dynamic Clock Gating Enable 
0 Dynamic clock gating not enabled 
1 Dynamic clock gating enabled 
SUME Suspend Mode Enable 
0 Suspend mode not enabled 
1 Suspend mode enabled 
Table 13-1. PMR Field Descriptions 
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14 Programmable Interrupt 
Controller (Optional) 


This chapter describes the OpenRISC 1000 level one programmable interrupt 
controller. The interrupt controller facility is optional and an implementation may chose 
whether or not to implement it. If it is not implemented, interrupt input is directly 
connected to interrupt exception inputs. UPR[PICP] specifies whether the programmable 
interrupt controller is implemented or not. 


The Programmable Interrupt Controller has two special-purpose registers and 32 
maskable interrupt inputs. If implementation requires permanent unmasked interrupt 
inputs, it can use interrupt inputs [1:0] and PICMR[1:0] should be fixed to one. 


14.1Features 


The OpenRISC 1000 architecture defines an interrupt controller facility with up to 
32 interrupt inputs: 





PICMR 








> Mask Function =—~——————— 
INT [31:0] EXT INT EXCEPTION 




















a. PICSR 











Figure 14-1. Programmable Interrupt Controller Block Diagram 


14.2PIC Mask Register (PICMR) 


The interrupt controller mask register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


PICMR is used to mask or unmask 32 programmable interrupt sources. 
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Bit 31-0 
Identifier IUM 
Reset 0 
R/W R/W 
IUM Interrupt UnMask 


0x00000000 All interrupts are masked 
0x00000001 Interrupt input 0 is enabled, all others are masked 











OxFFFFFFFF All interrupt inputs are enabled 





Table 14-1. PICMR Field Descriptions 


14.3PIC Status Register (PICSR) 


The interrupt controller status register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


PICSR is used to determine the status of each PIC interrupt input. PIC can support 
level-triggered interrupts or combination of level-triggered and edge-triggered. Most 
implementations today only support level-triggered interrupts. 

For level-triggered implementations bits in PICSR simply represent level of 
interrupt inputs. Interrupts are cleared by taking appropriate action at the device to negate 
the source of the interrupt.Writing a '1' or a '0' to bits in the PICSR that reflect a level- 
triggered source must have no effect on PICSR content. 


The atomic way to clear an interrupt source which is edge-triggered is by writing a 
'l' to the corresponding bit in the PICSR. This will clear the underlying latch for the 
edge-triggered source. Writing a '0' to the corresponding bit in the PICSR has no effect on 
the underlying latch. 




















Bit 31-0 
Identifier IS 
Reset 0 
R/W R/(W*) 











IS Interrupt Status 
0x00000000 All interrupts are inactive 
0x00000001 Interrupt input 0 is pending 








OxFFFFFFFF All interrupts are pending 





Table 14-2. PICSR Field Descriptions 
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15 Tick Timer Facility (Optional) 


This chapter describes the OpenRISC 1000 tick timer facility. It is optional and an 
implementation may chose whether or not to implement it. UPR[TTP] specifies whether 
or not the tick timer facility is present. 


The Tick Timer is used to schedule operating system and user tasks on regular time 
basis or as a high precision time reference. 


The Tick Timer facility is enabled with TTMR[M]. TTCR is incremented with 
each clock cycle and a tick timer interrupt can be asserted whenever the lower 28 bits of 
TTCR match TTMR[TP] and TTMR[IE] is set. 


TTCR restarts counting from zero when a match event happens and TTMR[M] is 
0x1. If TTMR[M] is 0x2, TTCR is stoped when match event happens and TTCR must be 
changed to start counting again. When TTMR[M] is 0x3, TTCR keeps counting even 
when match event happens. 


15.1Features 


The OpenRISC 1000 architecture defines a tick timer facility with the following 
features: 


¥ Maximum timer count of 2/32 clock cycles 

¥ Maximum time period of 2428 clock cycles between interrupts 
¥ Maskable tick timer interrupt 

v Single run, restartable counter, or continues counter 





TTMR 




















TICK INT 
» TTCR |____+] 





RISC clk 











Figure 15-1. Tick Timer Block Diagram 


15.2Timer interrupts 


A timer interrupt will happen everytime TTMR[IE] bit is set and TTMR[TP] 
matches the lower 28-bits of the TTCR SPR, the top 4 bits are ignored for the 
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comparison. When an interrupt is pending the TTMR[IP] bit will be set and the interrupt 
will be asserted to the cpu core until it is cleared by writting a 0 to the TTMR[IP] bit. 
However, if the TTMR[IE] bit was not set when a match condition occured no interrupt 
will be asserted and the TTMRI[IP] bit won't be set unless it has not been cleared from a 
previous interrupt. The TTMR[IE] bit is not meant as a mask bit, SR[TEE] is provided 
for that purpose. 


15.3Timer modes 


It is up to the programmer to ensure that the TTCR SPR is set to a sane value 
before the timer mode is programmed. When the timing mode is programmed into the 
timer by setting TTMR[M], the TTCR SPR is not preset to any predefined value, 
including 0. If the lower 28-bits of the TTCR SPR is numerically greater than what was 
programmed into TTMR[TP] then the timer will only assert the timer interrupt when the 
lower 28-bits of the TTCR SPR have wrapped around to 0 and counted up to the match 
value programmed into TTMR[TP]. 


15.3.1 Disabled timer 


In this mode the timer does not increment the TTCR spr. Though note that the timer 
interrupt is independent from the timer mode and as such the timer interrupt is not 
disabled when the timer is disabled. 


15.3.2 Auto-restart timer 


When the timer is set to auto-restart mode, the timer will reset the TTCR spr to 0 as soon 
as the lower 28-bits of the TTCR spr match TTMR[TP] and the timer interrupt will be 
asserted to the cpu core if the TTMRI[IE] bit has been set. 


15.3.3 One-shot timer 


In one-shot timeing mode, the timer stops counting as soon as a match condition has been 
reached. Although the timer has in effect been disabled (and can't be restarted by writting 
to the TTCR spr) the TTMR[M] bits shall still indicate that the timer is in one-shot mode 
and not that it has been disabled. Care should be taken that the timer interrupt has been 
masked (or disabled) after the match condition has been reached, or else the cpu core will 
get a spurious timer interrupt. 


15.3.4 Continuous timer 


In the event that a match condition has been reached, the counter does not stop but rather 
keeps counting from the value of the TTCR spr and the timer interrupt will be asserted if 
the TTMR[IE] bit has been set. 
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15.4Tick Timer Mode Register (TTMR) 


The tick timer mode register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

The TTMR is programmed with the time period of the tick timer as well as with 
the mode bits that control operation of the tick timer. 


























Bit 31-30] 29 | 28 | 27-0 
Identifier M IE IP TP 
Reset x 
R/IW RW 

TP Time Period 


0x0000000 Shortest comparison time period 


OxFFFFFFF Longest comparison time period 


IP Interrupt Pending 
O Tick timer interrupt is not pending 
1 Tick timer interrupt pending (write ‘0’ to clear it) 
IE Interrupt Enable 
O Tick timer does not generate tick timer interrupt 
1 Tick timer generates tick timer interrupt when TTMR[TP] matches TTCR[27:0] 


M Mode 
00 Tick timer is disabled 
01 Timer is restarted when TTMR[TP] matches TTCR[27:0] 


10 Timer stops when TTMR[TP] matches TTCR[27:0] (change TTCR to resume 
counting) 


11 Timer does not stop when TTMR[TP] matches TTCR[27:0] 

















Table 15-1. TTMR Field Descriptions 


15.5Tick Timer Count Register (TTCR) 


The tick timer count register is a 32-bit special-purpose register accessible with the 
l.mtspr/l.mfspr instructions in supervisor mode and as read-only register in user mode if 
enabled in SR[SUMRA]. 

TTCR holds the current value of the timer. 











Bit 31-0 
Identifier CNT 
Reset 0 

R/W R/W 
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CNT Count 
32-bit incrementing counter 





Table 15-2. TTCR Field Descriptions 
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16 OpenRISC 1000 
Implementations 


16.10verview 
Implementations of the OpenRISC 1000 architecture come in different 
configurations and version releases. 


Version and unit present registers both identify the model, version and its 
configuration. Detailed configuration for some units is available in configuration 
registers. 


An operating system can read VR, UPR and the configuration registers, and adjust 
its own operation if required. Operating systems ported on a particular OpenRISC version 
should run on different configurations of this version without modifications. 


16.2Version Register (VR) 


The version register is a 32-bit special-purpose supervisor-level register accessible 
with the |.mtspr/l.mfspr instructions in supervisor mode. 

It identifies the version (model) and revision level of the OpenRISC 1000 
processor. It also specifies the possible template on which this implementation is based. 


This register is deprecated, and the AVR and VR2 SPR should be used to 
determine more accurately the version information. 












































Bit | 31- 23-16 15-7 6 5-0 
24 
Iden) VE CFG Reserved UVRP REV 
tifier | R 
Res - - X - - 
et 
RIW| R R R R R 
REV Revision 
0..63 A 6-bit number that identifies various releases of a particular version. This 
number is changed for each revision of the device. 
UVRP Updated Version Registers Present 
A bit indicating that the AVR and VR2 SPRs are available and should be used to 
determine version information. 
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Configuration Template 
0..99 An 8-bit number that identifies particular configuration. However this is just 
for operating systems that do not use information provided by configuration 
registers and thus are not truly portable across different configurations of one 
implementation version. 
Configurations that do implement configuration registers must have their CFG 
smaller than 50 and configurations that do not implement configuration registers 
must have their CFG 50 or bigger. 


Version 
0x10..0x19 An 8-bit number that identifies a particular processor version and 
version of the OpenRISC architecture. Values below 0x10 and above 0x19 are 
illegal for OpenRISC 1000 processor implementations. 








Table 16-1. VR Field Descriptions 


16.3Unit Present Register (UPR) 


The unit present register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 

It identifies the present units in the processor. It has a bit for each possible unit or 
functionality. The lower sixteen bits identify the presence of units defined in the 
OpenRISC 1000 architecture. The upper sixteen bits define the presence of custom units. 












































Bit 31-24 9 8 7 
Identifier CUP PMP PICP PCUP 
Reset - - - - 
R/W R R R R 
Bit 6 5 4 2 1 0 
Identifier DUP MP IMP ICP DCP UP 
Reset 

































UP 


DCP 


ICP 


UPR Present 
O UPR is not present 
1 UPR is present 


Data Cache Present 
O Unit is not present 
1 Unit is present 
Instruction Cache Present 
O Unit is not present 
1 Unit is present 





DMP 
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Data MMU Present 
O Unit is not present 
1 Unit is present 
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IMP Instruction MMU Present 
O Unit is not present 
1 Unit is present 


MP MAC Present 
O Unit is not present 
1 Unit is present 


DUP Debug Unit Present 
O Unit is not present 
1 Unit is present 


PCUP Performance Counters Unit Present 
O Unit is not present 
1 Unit is present 


PMP Power Management Present 
O Unit is not present 
1 Unit is present 
PICP Programmable Interrupt Controller Present 
O Unit is not present 
1 Unit is present 
TTP Tick Timer Present 
O Unit is not present 
1 Unit is present 


CUP Custom Units Present 























Table 16-2. UPR Field Descriptions 


16.4CPU Configuration Register (CPUCFGR) 


The CPU configuration register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


It specifies CPU capabilities and configuration. 








































































31-14 
Identifi Reserved AECSRP 
er 
Reset - - - - - - 
R/W R R R R R R 
Bit 9 8 4 3-0 
Identifier OV64S OF64S CGF NSGF 
Reset - - - - - - - 
R/W R R R R R R R 

















NSGF Number of Shadow GPR Files 
O Zero shadow GPR files 
15 Fifteen shadow GPR Files 
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Custom GPR File 
O GPR file has 32 registers 
1 GPR file has less than 32 registers 





OB32S 


OB64S 


OF32S 


ORBIS32 Supported 
O Not supported 
1 Supported 
ORBIS64 Supported 
O Not supported 
1 Supported 
ORFPX32 Supported 
O Not supported 
1 Supported 





OF64S 


OV64S 


ND 


AVRP 


ORFPX64 Supported 
O Not supported 
1 Supported 


ORVDX64 Supported 
O Not supported 
1 Supported 


No Delay-Slot 


0 CPU executes delay slot of jump/branch instructions before taking 
jump/branch 


1 CPU does not execute instructions in delay slot if taking jump/branch 
Architecture Version Register (AVR) Present 
0 AVR not present 
1 AVR present 





EVBARP 


ISRP 


AECSRP 








Exception Vector Base Address Register (EVBAR) Present 
0 EVBAR not present 
1 EVBAR present 


Implementation-Specific Registers (ISRO-7) Preset 
0 ISRs not present 
1 ISRs present 
Arithmetic Exception Control Register (AECR) and Arithmetic Exception Status 
Register (AESR) present 
0 AECR and AESR not present 
1 AECR and AESR present 





Table 16-3. CPUCFGR Field Descriptions 


16.5DMMU Configuration Register (DMMUCFGR) 


The DMMU configuration register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


It specifies the DMMU capabilities and configuration. 
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Bit 31-12 
Identifier Reserved 
Reset - 
R/W R 



















Identifier 
Reset - - - 
R/W R R R 
































NTW Number of TLB Ways 
0 DTLB has one way 


3 DTLB has four ways 


NTS Number of TLB Sets (entries per way) 
O DTLB has one set (entries per way) 





7 DTLB has 128 sets (entries per way) 


NAE Number of ATB Entries 
O DATB does not exist 
1 DATB has one entry 


4 DATB has four entries 
5..7 Invalid values 


CRI Control Register Implemented 
0 DMMUCR not implemented 
1 DMMUCR implemented 


PRI Protection Register Implemented 
0 DMMUPR not implemented 
1 DMMUPR implemented 
TEIRI TLB Entry Invalidate Register Implemented 
0 DTLBEIR not implemented 
1 DTLBEIR implemented 
HTR Hardware TLB Reload 
0 TLB Entry reloaded in software 
1 TLB Entry reloaded in hardware 




















Table 16-4. DMMUCFGR Field Descriptions 


16.6IMMU Configuration Register (IMMUCFGR) 


The IMMU configuration register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


It specifies IMMU capabilities and configuration. 
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Bit 31-12 
Identifier Reserved 
Reset - 
R/W R 














Identifier 




































0 ITLB has one set (entries per way) 


7 ITLB has 128 sets (entries per way) 
NAE Number of ATB Entries 
0 IATB does not exist 
1 IATB has one entry 


4 IATB has four entries 
5..7 Invalid values 

CRI Control Register Implemented 

0 IMMUCR not implemented 
1 IMMUCR implemented 


Reset - - - - - - 
R/W R R R R R R 
NTW Number of TLB Ways 

0 ITLB has one way 

3 ITLB has four ways 
NTS Number of TLB Sets (entries per way) 





PRI Protection Register Implemented 
0 IMMUPR not implemented 
1 IMMUPR implemented 
TEIRI TLB Entry Invalidate Register Implemented 
0 ITLBEIR not implemented 
1 ITLBEIR implemented 
HTR Hardware TLB Reload 
0 ITLB Entry reloaded in software 
1 ITLB Entry reloaded in hardware 

















Table 16-5. IMMUCFGR Field Descriptions 


16.7DC Configuration Register (DCCFGR) 


The DC configuration register is a 32-bit special-purpose supervisor-level register 


accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 
It specifies data cache capabilities and configuration. 
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Bit 31-15 14 13 12 
Identifier Reserved CBWBRI CBFRI CBLRI 
Reset - : - - 
R/W R R R R 
Identifier 
Reset - - = - - 
R/W R R R R R 
NCW Number of Cache Ways 
0 DC has one way 
5 DC has thirty-two ways 
NCS Number of Cache Sets (cache blocks per way) 
0 DC has one set (cache blocks per way) 
10 DC has 1024 sets (cache blocks per way) 
BS Cache Block Size 
0 Cache block size 16 bytes 
1 Cache block size 32 bytes 
CWS Cache Write Strategy 
0 Cache write-through 
1 Cache write-back 
CCRI Cache Control Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBIRI Cache Block Invalidate Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBPRI Cache Block Prefetch Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBLRI Cache Block Lock Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBFRI Cache Block Flush Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBWBRI Cache Block Write-Back Register Implemented 
O Register is not implemented 
1 Register is implemented 
Table 16-6. DCCFGR Field Descriptions 
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16.8I1C Configuration Register (ICCFGR) 


The IC configuration register is a 32-bit special-purpose supervisor-level register 
accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


It specifies instruction cache capabilities and configuration. 


































Bit 31-13 12 
Identifier Reserved CBLRI 
Reset - - 
R/W R R 
Bit 11 10 9 7 6-3 2-0 
Identifier CBPRI CBIRI CCRI CBS NCS NCW 
Reset 

































NCW Number of Cache Ways 
0 IC has one way 


5 IC has thirty-two ways 


NCS Number of Cache Sets (cache blocks per way) 
0 IC has one set (cache blocks per way) 


10 IC has 1024 sets (cache blocks per way) 


BS Cache Block Size 
0 Cache block size 16 bytes 
1 Cache block size 32 bytes 
CCRI Cache Control Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBIRI Cache Block Invalidate Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBPRI Cache Block Prefetch Register Implemented 
O Register is not implemented 
1 Register is implemented 
CBLRI Cache Block Lock Register Implemented 
O Register is not implemented 
1 Register is implemented 


























Table 16-7. ICCFGR Field Descriptions 
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16.9Debug Configuration Register (DCFGR) 


The debug configuration register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mtspr/l.mfspr instructions in supervisor mode. 


It specifies debug unit capabilities and configuration. 























Bit 2-0 
Identifier NDP 
Reset 7 
R/W R 

NDP Number of Debug Pairs 


0 Debug unit has one DCR/DVR pair 


7 Debug unit has eight DCR/DVR pairs 

WPCI Watchpoint Counters Implemented 

0 Watchpoint counters not implemented 
1 Watchpoint counters implemented 














Table 16-8. DCFGR Field Descriptions 


16.10 Performance Counters Configuration 
Register (PCCFGR) 


The performance counters configuration register is a 32-bit special-purpose 
supervisor-level register accessible with the |.mtspr/l.mfspr instructions in supervisor 
mode. 

It specifies performance counters unit capabilities and configuration. 























Bit 31-3 2-0 
Identifier Reserved NPC 
Reset : - 
R/W R R 

NPC Number of Performance Counters 


O One performance counter 


7 Eight performance counters 











Table 16-9. PCCFGR Field Descriptions 
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16.11 Version Register 2 (VR2) 


The version register 2 is a 32-bit special-purpose supervisor-level register 
accessible with the |.mfspr instruction in supervisor mode. 

It holds implementation-specific version information. It is intended to replace the 
VR register. 

The value in the CPUID field should correspond to an implementation list held on 
the site which hosts this document. It is most likely that a master list will also be 
maintained at OpenCores.org. 

Its presence is indicated by the UVRP bit in the Version Register (VR). 



































Bit 31-24 23-0 
Identifier CPUID VER 
Reset - - 
R/W R R 
CPUID CPU Identification Number 


Implementation-specific identification number. Each implementation should have 
a unique identification number. 
VER Version 
Implementation-specific version number. This field, if interpreted as an unsigned 
24-bit number, should increase for each new version. The implementation 
reference manual should document the meaning of this value. 











Table 16-10. VR2 Field Descriptions 


16.12 Architecture Version Register (AVR) 


The architecture version register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mfspr instruction in supervisor mode. 

It indicates the most recent version the implementation contains features from .The 
implementation must at least implement an accurate set of feature-presence bits in the 
appropriate registers according to that version of the architecture spec, so the presence of 
each of that version's features can be checked. Its presence is indicated by the AVRP bit 
in the CPU Configuration Register (CPUCFGR). 












































Bit 31-24 23-16 15-8 7-0 
Identifier MAJ MIN REV Reserved 
Reset - - - - 
R/W R R R R 
MAJ Major Architecture Version Number 
MIN Minor Architecture Version Number 
REV Architecture Revision Number 
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Exception Vector Base Address Register 


The architecture version register is a 32-bit special-purpose supervisor-level 
register accessible with the |.mfspr/ l.mtspr instructions in supervisor mode. 


This optional register can be used to apply an offset to the exception vector 


addresses. 
Register (CPUCFGR). 


Its presence is indicated by the EVBARP bit in the CPU Configuration 


If SR[EPH] is set, this value is logically ORed with the offset that provides. 



































16.14 
(AECR) 





Table 16-12. EVBAR Field Descriptions 


Bit 31-13 12-0 
Identifier EVBA Reserved 
Reset - - 
R/W R/W R 
EVBA Exception Vector Base Address 
Location for the start of exception hens Its reset value is implementation- 





Arithmetic Exception Control Register 


The arithmetic exception control register is a 32-bit special-purpose supervisor- 
level register accessible with the |.mfspr/ |.mtspr instructions in supervisor mode. 

This optional register can be used for fine-grained control over which arithmetic 
operations trigger overflow exceptions when the OVE bit is set in the Supervision 
Register (SR). Its presence is indicated by the AECSRP bit in the CPU Configuration 












































Register (CPUCFGR). 
Bit 31-7 6 5 
identifier Reserved OVMACADDE | CYMACADDE 
Reset - 0 6) 
RW R RW RW 
Bit 4 3 2 | 1 | 0 
identifier DBZE OVMULE | CYMULE | OVADDE | CYADDE 
Reset 0 0 
RW 







































CYADDE 
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OVADDE Overflow on Add Exception 
Overflow flag set by signed overflow on integer addition and subtraction 
instructions causes exception 
CYMULE Carry on Multiply Exception 
Carry flag set by unsigned overflow on integer multiplication instructions 
causes exception 
OVMULE Overflow on Multiply Exception 
Overflow flag set by signed overflow on integer multiplication instructions 
causes exception 
DBZE Divide By Zero Exception 
Overflow flag set by divide-by-zero on integer division instruction, or carry 
flag set by divide-by-zero on I.divu instruction, causes exception 
CYMACADDE Carry on MAC Addition Exception 
Carry flag set by unsigned overflow on integer addition stage of MAC 
instructions causes exception 
OVMACADDE Overflow on MAC Addition Exception 


Overflow flag set by signed overflow on integer addition stage of MAC 
instructions causes exception 














Table 16-13. EACR Field Descriptions 


16.15 Arithmetic Exception Status Register 
(AESR) 


The arithmetic exception status register is a 32-bit special-purpose supervisor-level 
register accessible with the l.mfspr/l.mtspr instructions in supervisor mode. 


This optional register indicates which arithmetic operations triggered an exception. 
The exceptions are triggered when the OVE bit is set in the Supervision Register (SR), 
and the overflow or carry flag is set according to any conditions with the corresponding 
bit set in the Arithmetic Exception Control Register (AECR). 


This register will indicate which condition in the Arithmetic Exception Control 
Register (AECR) caused the exception by setting the corresponding bit. The bits can be 
cleared by writing '0' to them. The exception will occur due to the arithmetic operation, 
not due to the flags in this register being set, so failing to clear the flag before returning 
from exception with SR[CY] or SR[OV] set will not cause another exception.. 


Its presence is indicated by the AECSRP bit in the CPU Configuration Register 
(CPUCFGR). 



































Bit 31-7 6 5 
Identifier Reserved OVMACADDE | CYMACADDE 
Reset - 0 0 
RW R R/IW RW 
Bit 4 3 5 | rt | 0 
Identifier DBZE 
Reset 0 
RW RW 
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CYADDE Carry on Add Exception 
Carry flag set by unsigned overflow on integer addition and subtraction 
instructions caused exception 
OVADDE Overflow on Add Exception 
Overflow flag set by signed overflow on integer addition and subtraction 
instructions caused exception 
CYMULE Carry on Multiply Exception 
Carry flag set by unsigned overflow on integer multiplication instructions 
caused exception 
OVMULE Overflow on Multiply Exception 
Overflow flag set by signed overflow on integer multiplication instructions 
caused exception 
DBZE Divide By Zero Exception 
Overflow flag set by divide-by-zero on integer division instruction, or carry 
flag set by divide-by-zero on I.divu instruction, caused exception 
CYMACADDE Carry on MAC Addition Exception 
Carry flag set by unsigned overflow on integer addition stage of MAC 
instructions caused exception 
OVMACADDE Overflow on MAC Addition Exception 


Overflow flag set by signed overflow on integer addition stage of MAC 
instructions caused exception 




















Table 16-14. EASR Field Descriptions 


16.16 Implementation-Specific Registers (ISRO- 
’) 
The implementation-specific registers are 32-bit special-purpose supervisor-level 
register accessible with the |.mfspr instruction in supervisor mode. 


They are SPR space which can be used by implementations for any purpose. Their 
presence is indicated by the ISRP bit in the CPU Configuration Register (CPUCFGR). 
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17 Application Binary Interface 


The ABI is currently defined only for 32-bit OpenRISC. When a toolchain is developed 
for 64-bit, this section will need updating. 


17.1Data Representation 


17.1.1 Fundamental Types 


Scalar types in the ISO/ANSI C language are based on memory operands 
definitions from the chapter entitled “Addressing Modes and Operand Conventions” on 
page 20. Similar relations between architecture and language types can be used for any 
other language. 












































Type CTYPE SIZEOF ALIGNMENT OPENRISC 
(BYTES) EQUIVALENT 
char 1 1 Signed byte 
signed char 
unsigned char Unsigned byte 
short Signed halfword 
signed short 
unsigned short 2 2 Unsigned halfword 
int 4 4 Signed singleword 
Integral signed int 
long 
signed long 
enum 
unsigned int 4 4 Unsigned singleword 
long long 8 4 Signed doubleword 
signed long long 
unsigned long long 8 4 Unsigned doubleword 
: Any-type * 4 4 Unsigned singleword 
Pointer Any-type (*) () 
Floating- float 4 4 Single precision float 
point double 8 4 Double precision float 








Table 17-1. Scalar Types 


Prior versions of this table specified a native 8-byte alignment for 8-byte values. 
Since current OR1200 implementation never required this, and the compiler did not 
implement it, the specification has changed to match the 32-bit OpenRISC platform in 


use. 


A null pointer of any type must be zero. All floating-point types are IEEE-754 


compliant. 
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The OpenRISC programming model introduces a set of fundamental vector data 
types, as described by Table 17-2. For vector assignments both sides of an assignment 
must be of the same vector type. 





























VECTOR TYPE SIZEOF ALIGNMENT OPENRISC EQUIVALENT 
(BYTES) 
Vector char 8 8 Vector of signed bytes 
Vector signed char 
Vector unsigned char 8 8 Vector of unsigned bytes 
Vector short 8 8 Vector of signed halfwords 
Vector signed short 
Vector unsigned short 8 8 Vector of unsigned halfwords 
Vector int 8 8 Vector of signed singlewords 
Vector signed int 
Vector long 
Vector signed long 
Vector unsigned int 8 8 Vector of unsigned singlewords 
Vector float 8 8 Vector of single-precisions 




















Table 17-2. Vector Types 


For alignment restrictions of all types see the section entitled “Aligned and 
Misaligned Accesses” on page 20. 


17.1.2 Aggregates and Unions 


Aggregates (structures and arrays) and unions assume the alignment of their most 
strictly aligned element. 


v Anarray uses the alignment of its elements. 


yY Structures and unions can require padding to meet alignment restrictions. Each 
element is assigned to the lowest aligned address. 


struct { 
char C; C 
ti 


Figure 17-1. Byte aligned, sizeof is 1 
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struct { 
char C; 
char D; 
short §S; 
long N; 
3; 
Figure 17-2. No padding, sizeof is 8 
Cc Pad 
struct { 
char C; D 
double D; 
short S; D 
} Ss Pad 
Figure 17-3. Padding, sizeof is 16 
17.1.3 Bit-fields 


C structure and union definitions can have elements defined by a specified number 


of bits. Table 17-3 describes valid bit-field types and their ranges. 











Bit-fields follow the same alignment rules as aggregates and unions, with the 


following additions: 
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Bit-field Type Width w [bits] Range 
signed char -2"7 to 24-1 
char 1to8 0 to 2”-1 
unsigned char 0 to 2”-1 
signed short -2"7 to 21-1 
short 1 to 16 0 to 2"-1 
unsigned short 0 to 2”-1 
signed int -2”1 to 21-1 
int 0 to 2”-1 
enum Oto 2”-1 
unsigned int 1 to 32 0 to 2”-1 
signed long -2"1 to 2" 4-1 
long 0 to 2”-1 
unsigned long 0 to 2”-1 


Table 17-3. Bit-Field Types and Ranges 
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v Bit-fields are allocated from most to least significant (from left to right) 
A bit-field must entirely reside in a storage unit appropriate for its declared type. 


v Bit-fields may share a storage unit with other struct/union elements, including 
elements that are not bit-fields. Struct elements occupy different parts of the storage 
unit. 


v¥ Unnamed bit-fields’ types do not affect the alignment of a structure or union 


AN 




















struct { 
short $:9; Pad 
s(9 J(Q C(8 
Sak 3:9: (9) (9) (6) (8) 
: Pad Pad 
char C; T) | @ | YO | a 
short T:9; 
short  U:9; D(8) Pad (24) 
char Be 
}; 


Figure 17-4. Storage unit sharing and alignment padding, sizeof is 12 


17.2Function Calling Sequence 


This section describes the standard function calling sequence, including stack 
frame layout, register usage, parameter passing, and so on. The standard calling sequence 
requirements apply only to global functions, however it is recommended that all 
functions use the standard calling sequence. 


17.2.1 Register Usage 

The OpenRISC 1000 architecture defines 32 general-purpose registers. These 
registers are 32 bits wide in 32-bit implementations and 64 bits wide in 64-bit 
implementations. 

















Register Preserved across function calls Usage 
R31 No Temporary register 
R30 Yes Callee-saved register 
R29 No Temporary register 
R28 Yes Callee-saved register 
R27 No Temporary register 
R26 Yes Callee-saved register 
R25 No Temporary register 
R24 Yes Callee-saved register 
R23 No Temporary register 
R22 Yes Callee-saved register 
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Register Preserved across function calls Usage 

R21 No Temporary register 

R20 Yes Callee-saved register 

R19 No Temporary register 

R18 Yes Callee-saved register 

R17 No Temporary register 

R16 Yes Callee-saved register 

R15 No Temporary register 

R14 Yes Callee-saved register 

R13 No Temporary register 

R12 No Temporary register for 64-bit 

RVH - Return value upper 32 bits of 

64-bit value on 32-bit system 

R11 No RV — Return value 

R10 Yes Thread Local Storage 

R9 Yes LR — Link address register 

R8 No Function parameter word 5 

R7 No Function parameter word 4 

R6 No Function parameter word 3 

R5 No Function parameter word 2 

R4 No Function parameter word 1 

R3 No Function parameter word 0 

R2 Yes FP - Frame pointer (optional) 

R1 Yes SP - Stack pointer 

RO - Fixed to zero 














Table 17-4. General-Purpose Registers 


Some registers have assigned roles: 


RO [Zero] 
R1 [SP] 


R2 [FP] 


R3 through R8 
R9 [LR] 


R10 [TLS] 


www.openrisc.io 





Holds a zero value. 


The stack pointer holds the limit of the current stack frame. The first 128 
bytes below the stack pointer are reserved for leaf functions, and below that 
are undefined. Stack pointer must be word aligned at all times. 


The frame pointer holds the address of the previous stack frame. Incoming 
function parameters reside in the previous stack frame and can be accessed 
at positive offsets from FP. The compiler may use this register for other 
puposes if instructed. 


General-purpose parameters use up to 6 general-purpose registers. 
Parameters beyond the sixth word appear on the stack. 


Link address is the location of the function call instruction and is used to 
calculate where program execution should return after function completion. 


Thread Local Storage host the address of this context’s thread local 
storage structure. This mechanism, as normally provided by the compiler, 
allows designated variables to have one instance per thread. 
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R11 [RV] Return value of the function. For void functions a value is not defined. For 
functions returning a union or structure, a pointer to the result is placed into 
return value register. 


R12 [RVH] Return value high of the function. For functions returning 32-bit values this 
register can be considered temporary register. Note that this holds the less 
significant bits on big-endian implementations; 32-bit values still go in RV. 


On big-endian implementations, R11 is used for the high 32 bits of 64-bit return 
values and R12 is used for the low 32 bits. On little-endian implementations this is 
reversed. This matches register order with memory storage. 

Furthermore, an OpenRISC 1000 implementation might have several sets of 


shadowed general-purpose registers. These shadowed registers are used for fast context 
switching and sets can be switched only by the operating system. 


17.2.2 The Stack Frame 


In addition to registers, each function has a frame on the run-time stack. This stack 
grows downward from high addresses. Table 17-5 shows the stack frame organization. 














Position Contents Frame 
FP + 4N Parameter N 
wi eh Previous 
FP +0 First stack parameter 
FP—4 Return address 
FPS6 Previous FP value 
FP —12 Function variables Curren 
SP +0 Subfunction call parameters 
SP-4 For use by leaf functions w/o function 
SP — 128 prologue/epilogue 
eee Future 
SP — 2536 For use by exception handlers 

















Table 17-5. Stack Frame 

When no compiler optimization is in place, the stack pointer always points to the 
end of the latest allocated stack frame. However when optimization is in effect the stack 
pointer may not be updated, so that up to 128 bytes beyond the current stack pointer are 
in use. 

Optimized code will in general not use the frame pointer, freeing it up for use as 
another temporary register. 

All frames must be word aligned. 

The first 128 bytes below the current stack frame are reserved for use by optimized 
code. Exception handlers must guarantee that they will not use this area. 
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17.2.3 Parameter Passing 


Functions receive up to their first 6 arguments in general-purpose parameter 
registers. No register holds more than one argument, and 64-bit arguments use two 
adjacent words. If there are more than six words, the remaining arguments are passed on 
the stack. Structure and union arguments are passed as pointers. 


All 64-bit arguments in a 32-bit system are passed using a pair of words when 
available, in the same way as for other arguments. 64-bit arguments are not aligned. For 
example long long arg1, long arg2, long long arg3 are passed in the following way: arg1 
in r3&r4, arg2 inr5, arg3 in r6&r7. 


On big-endian implementations the high 32 bits are passed in the lower numbered 
register of the pair. On little-endian implementations this is reversed. 


Individual arguments are not split across registers and stack, and variadic 
arguments are always put on the stack. For example, printf(char *fmt, ...) only takes one 
register argument, fmt. 


For C++, the first argument word is the this pointer. 


17.2.4 Functions Returning Scalars or No Value 


A function that returns an integral, pointer or vector/floating-point value places its 
result in the general-purpose RV register. void functions put no particular value in 
GPR[RV] register. 


64-bit return values also use the RVH register, which is otherwise undefined and 
not preserved across function calls. 


17.2.5 Functions Returning Structures or Unions 


A function that returns a structure or union places the address of the structure or 
union in the general-purpose RV register. 


A function that returns a structure by value expects the location where that 
structure is to be placed to be supplied in function parameter word 0 (R3). 


17.30perating System Interface 


17.3.1 Exception Interface 


The OpenRISC 1000 exception mechanism allows the processor to change to 
supervisor mode as a result of external signals, errors or execution of certain instructions. 
When an exception occurs the following events happen: 

v The address of the interrupted instruction, supervisor register and EA (when relevant) 
are saved into EPCR, ESR and EEAR registers 

v¥ The machine mode is changed to supervisor mode as per section 6.3, Exception 
Processing. This includes disabling MMUs and exceptions. 

v The execution resumes from a predefined exception vector address which is different 
for every exception 
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Exception Type | Vector Offset[11:0] SIGNAL Example 
Reset 0x100 None Reset 
Bus Error 0x200 SIGBUS Unexisting physical location, bus parity 
error. 
Data Page Fault 0x300 SIGSEGV | Unmapped data location or protection 
violation. 
Instruction Page 0x400 SIGSEGV Unmapped instruction location or 
Fault protection violation 
Tick Timer 0x500 None Process scheduling 
Interrupt 
Alignment 0x600 SIGBUS Unaligned data 
Illegal Instruction 0x700 SIGILL Illegal/unimplemented instruction 
External Interrupt 0x800 None Device has asserted an interrupt 
D-TLB Miss 0x900 None DTLB software reload needed 
I-TLB Miss OxA00 None ITLB software reload needed 
Range OxBOO SIGSEGV Arithmetic overflow 
System Call OxC00 None Instruction I.sys 
Trap OxE0O SIGTRAP Instruction I.trap or debug unit 
exception. 








Table 17-6. Hardware Exceptions and Signals 


The significant bits (31-12) of the vector offset address for each exception depend 
on the setting of the Supervision Register (SR)'s EPH bit and presence and setting of of 
the Exception Vector Base Address Register (EVBAR), which can specify an offset. For 
example, in the absence of the EVBAR and with SR[EPH] clear, the offset is zero. 


The operating system handles an exception either by completing the faulting 
exception in a manner transparent to the application, if possible, or by delivering a signal 
to the application. Table 17-6 shows how hardware exceptions can be mapped to signals 
if the operating system cannot complete the faulting exception. 


17.3.2 Virtual Address Space 


For user programs to execute in virtual address space, the memory management 
unit (MMU) must be enabled. The MMU translates virtual address generated by the 
running process into physical address. This allows the process to run anywhere in the 
physical memory and additionally page to a secondary storage. 


Processes typically begin with three logical segments, commonly referred as 
“text”, “data” and “stack”. Additional segments may exist or can be created by the 
operating system. 


17.3.3 Page Size 


Memory is organized into pages, which are the system’s smallest units of memory 
allocation. The basic page size is 8KB with some implementations supporting 16MB and 
32GB pages. 
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17.3.4 Virtual Address Assignments 


Processes have full access to the entire virtual address space. However the size of a 
process can be limited by several factors such as a process size limit parameter, available 
physical memory and secondary storage. 





OxFFFF_FFFF Reserved system area 





Start of Stack Stack 
Growing Down 

















Growing Up Heap 

.bss 

Start of Data Segments data 
Start of Program Code .text 


Shared Objects 
Start of Dynamic Segment Area 


0x0000_ 2000 





Unmapped 








0x0000_0000 


Table 17-7. Virtual Address Configuration 
Page at location 0x0 is usually reserved to catch dereferences of NULL pointers. 


Usually the beginning address of “.text”, “.data” and “.bss” segments are defined 
when linking the executable file. The heap is adjusted with facilities such as malloc and 
free. The dynamic segment area is adjusted with mmap, and the stack size is limited with 
setrlimit. 


17.3.5 Stack 


Every process has its own stack that is not tied to a fixed area in its address space. 
Since the stack can change differently for each call of a process, a process should use the 
stack pointer in general-purpose register r1 to access stack data. 


17.3.6 Processor Execution Modes 


The OpenRISC 1000 provides two execution modes: user and supervisor. 
Processes run in user mode and the operating system’s kernel runs in supervisor mode. A 
Process must execute the /.sys instruction to switch to supervisor mode, hence requesting 
service from the operating system. It is suggested that system calls use the same argument 
passing model as used with function calls, except additional register r11 specifies system 
call id. 
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17.4Position-Independent Code 


This section needs to be written. Position-independent code is desired for proper 
dynamic linking support, which remains to be implemented. 


17.5ELF 


The OpenRISC tools use the ELF object file formats and DWARF debugging 
information formats, as described in System V Application Binary Interface, from the 
Santa Cruz Operation, Inc. ELF and DWARF provide a suitable basis for representing the 
information needed for embedded applications. Other object file formats are available, 
such as COFF. This section describes particular fields in the ELF and DWARF formats 
that differ from the base standards for those formats. 


17.5.1 Header Convention 


The e_machine member of the ELF header contains the decimal value 33906 
(hexadecimal 0x8472) that is defined as the name EM_OR32. 


The e_ident member of the ELF header contains values as shown in Table 17-8. 
OR32 ELF e_ident Fields 





e_ident[El CLASS] ELFCLASS32 For all 32-bit implementations 


e_ident[El DATA] ELFDATA2MSB For all implementations 





Table 17-8. e_ident Field Values 


The e_flags member of the ELF header contains values as shown in Table 17-9. 
OR32 ELF e flags 


HAS RELOC Ox01 Contains relocation entries 
EXEC_P 0x02 Is directly executable 


HAS _LINENO 0x04 Has line number information 
HAS DEBUG 0x08 Has debugging information 
HAS_SYMS 0x10 Has symbols 

HAS _ LOCALS 0x20 Has local symbols 


DYNAMIC 0x40 Is dynamic object 
WP_TEXT 0x80 Text section is write protected 


D_PAGED 0x100 Is dynamically paged 


























Table 17-9. e_flags Field Values 


17.5.2 Sections 


There are no OpenRISC section requirements beyond the base ELF standards. 
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17.5.3 Relocation 


This section describes values and algorithms used for relocations. In particular, it 
describes values the compiler/assembler must leave in place and how the linker modifies 
those values. 





























Name Value Size Calculation 
R_OR32_NONE 0 0 None 
R_OR32_32 1 32 A 
R_OR32_16 2 16 A & OXxffff 
R_OR32_8 3 8 A & Oxff 
R_OR32_CONST 4 16 A & Oxffff 
R_OR32_CONSTH 5 16 (A>> 16) & Oxffff 
R_OR32_JUMPTARG 6 28 (S+A-P)>>2 


Key S indicates the final value assigned to the symbol refernced in the relocation 
record. Key A is the added value specified in the relocation record. Key P indicates the 
address of the relocation (e.g., the address being modified). 
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Instruction Mnemonic Function Page 
OOOOOONNNNNNNNNNNNNNNNNNNNNNNNNN lj Jump 
QOOOOLNNNNNNNNNNNNNNNNNNNNNNNNNN I.jal Jump and Link 
QOOOLTLNNNNNNNNNNNNNNNNNNNNNNNNNN |.onf Branch if No Flag 
QOOLOONNNNNNNNNNNNNNNNNNNNNNNNNN |.bf Branch if Flag 
00010101-------- KKKKKKKKKKKKKKKK I.nop No Operation 
000110DDDDD- - - -OKKKKKKKKKKKKKKKK |.movhi Move Immediate High 
000110DDDDD - - - -10000000000000000 |.macrc MAC Read and Clear 
0010000000000000KKKKKKKKKKKKKKKK I.sys System Call 
0010000100000000KKKKKKKKKKKKKKKK l.trap Trap 
00100010000000000000000000000000 I.msync Memory Synchronization 
00100010100000000000000000000000 l.psync Pipeline Synchronization 
00100011000000000000000000000000 l.csync Context Synchronization 
001001-------------------------- l.rfe Return From Exception 
001010------------------ 1100---- lv.cust1 Reserved for Custom Vector 

Instructions 
001010------------------ 1101---- lv.cust2 Reserved for Custom Vector 
Instructions 
001010------------------ 1110---- lv.cust3 Reserved for Custom Vector 
Instructions 
001010------------------ 1111---- lv.cust4 Reserved for Custom Vector 
Instructions 
00101 0DDDDDAAAAABBBBB- - -00010000 | _lv.all_eq.b Vector Byte Elements All 
Equal 
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Instruction Mnemonic Function Page 
00101 0DDDDDAAAAABBBBB - - -00010001 lv.all_eg.h Vector Half-Word Elements 
All Equal 
00101 0DDDDDAAAAABBBBB- - -00010010 | lv.all_ge.b Vector Byte Elements All 
Greater Than or Equal To 
00101 0DDDDDAAAAABBBBB - - -00010011 lv.all_ge.h Vector Half-Word Elements 
All Greater Than or Equal To 
00101 0DDDDDAAAAABBBBB- - - 0010100 lv.all_gt.b Vector Byte Elements All 
Greater Than 
00101 0DDDDDAAAAABBBBB - - -00010101 lv.all_gt.h Vector Half-Word Elements 
All Greater Than 
00101 0DDDDDAAAAABBBBB- - - 00010110 lv.all_le.b Vector Byte Elements All 
Less Than or Equal To 
001010DDDDDAAAAABBBBB- - -00010111 lv.all_le.h Vector Half-Word Elements 
All Less Than or Equal To 
00101 0DDDDDAAAAABBBBB- - - 00011000 lv.all_It.b Vector Byte Elements All 
Less Than 
00101 0DDDDDAAAAABBBBB- - -00011001 Iv.all_It.h Vector Half-Word Elements 
All Less Than 
001010DDDDDAAAAABBBBB- - -00011010 | lv.all_ne.b Vector Byte Elements All 
Not Equal 
001010DDDDDAAAAABBBBB- - - 00011011 lv.all_ne.h Vector Half-Word Elements 
All Not Equal 
00101 0DDDDDAAAAABBBBB - - - 00100000 lv.any_eq.b Vector Byte Elements Any 
Equal 
00101 0DDDDDAAAAABBBBB- - -@0100001 | Iv.any_eg.h Vector Half-Word Elements 
Any Equal 
00101 0DDDDDAAAAABBBBB - - - 00100010 lv.any_ge.b Vector Byte Elements Any 
Greater Than or Equal To 
00101 0DDDDDAAAAABBBBB- - -@0100011 | Iv.any_ge.h Vector Half-Word Elements 
Any Greater Than or Equal 
To 
00101 0DDDDDAAAAABBBBB- - -@0100100 | Iv.any_gt.b Vector Byte Elements Any 
Greater Than 
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Instruction Mnemonic Function Page 
00101 0DDDDDAAAAABBBBB- - -@0100101 | Iv.any_gt.h Vector Half-Word Elements 
Any Greater Than 
00101 0DDDDDAAAAABBBBB - - - 00100110 lv.any_le.b Vector Byte Elements Any 
Less Than or Equal To 
00101 0DDDDDAAAAABBBBB.- - -00100111 lv.any_le.h Vector Half-Word Elements 
Any Less Than or Equal To 
00101 0DDDDDAAAAABBBBB - - - 00101000 lv.any_It.b Vector Byte Elements Any 
Less Than 
001010DDDDDAAAAABBBBB- - -00101001 Iv.any_lt.h Vector Half-Word Elements 
Any Less Than 
00101 0DDDDDAAAAABBBBB - - - 00101010 lv.any_ne.b Vector Byte Elements Any 
Not Equal 
001010DDDDDAAAAABBBBB- - -00101011 | Iv.any_ne.h | Vector Half-Word Elements 
Any Not Equal 
00101 0DDDDDAAAAABBBBB- - - 00110000 lv.add.b Vector Byte Elements Add 
Signed 
001010DDDDDAAAAABBBBB- - - 00110001 lv.add.h Vector Half-Word Elements 
Add Signed 
00101 0DDDDDAAAAABBBBB - - - 00110010 lv.adds.b Vector Byte Elements Add 
Signed Saturated 
001010DDDDDAAAAABBBBB- - - 00110011 Iv.adds.h Vector Half-Word Elements 
Add Signed Saturated 
00101 0DDDDDAAAAABBBBB - - - 00110100 lv.addu.b Vector Byte Elements Add 
Unsigned 
00101 0DDDDDAAAAABBBBB.- - -00110101 Ilv.addu.h Vector Half-Word Elements 
Add Unsigned 
00101 0DDDDDAAAAABBBBB - - -00110110 lv.addus.b Vector Byte Elements Add 
Unsigned Saturated 
00101 0DDDDDAAAAABBBBB.- - -00110111 lv.addus.h Vector Half-Word Elements 
Add Unsigned Saturated 
00101 0DDDDDAAAAABBBBB - - -00111000 lv.and Vector And 
00101 0DDDDDAAAAABBBBB.- - -00111001 lv.avg.b Vector Byte Elements 
Average 
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Instruction Mnemonic Function Page 
00101 0DDDDDAAAAABBBBB - - - 00111010 lv.avg.h Vector Half-Word Elements 
Average 
001010DDDDDAAAAABBBBB - - -01000000 | Iv.cmp_eq.b Vector Byte Elements 
Compare Equal 
00101 0DDDDDAAAAABBBBB- - -01000001 | lv.cmp_eg.h | Vector Half-Word Elements 
Compare Equal 
001010DDDDDAAAAABBBBB - - -01000010 | lv.cmp_ge.b Vector Byte Elements 
Compare Greater Than or 
Equal To 
001010DDDDDAAAAABBBBB- - -01000011 | Iv.cmp_ge.h | Vector Half-Word Elements 
Compare Greater Than or 
Equal To 
00101 0DDDDDAAAAABBBBB - - - 01000100 lv.cmp_gt.b Vector Byte Elements 
Compare Greater Than 
001010DDDDDAAAAABBBBB- - -01000101 | lv.cmp_gt.h | Vector Half-Word Elements 
Compare Greater Than 
00101 0DDDDDAAAAABBBBB - - -01000110 lv.cmp_le.b Vector Byte Elements 
Compare Less Than or 
Equal To 
001010DDDDDAAAAABBBBB- - -01000111 | Iv.cmp_le.h | Vector Half-Word Elements 
Compare Less Than or 
Equal To 
00101 0DDDDDAAAAABBBBB - - - 01001000 lv.cmp_It.b Vector Byte Elements 
Compare Less Than 
001010DDDDDAAAAABBBBB- - -01001001 | lv.cmp_lt.h Vector Half-Word Elements 
Compare Less Than 
001010DDDDDAAAAABBBBB- - -01001010 | Iv.cmp_ne.b Vector Byte Elements 
Compare Not Equal 
001010DDDDDAAAAABBBBB- - -01001011 | Iv.cmp_ne.h | Vector Half-Word Elements 
Compare Not Equal 
001010DDDDDAAAAABBBBB- - - 01010100 lv.madds.h Vector Half-Word Elements 
Multiply Add Signed 
Saturated 
00101 0DDDDDAAAAABBBBB - - -01010101 lv.max.b Vector Byte Elements 
Maximum 
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Instruction Mnemonic Function Page 
00101 0DDDDDAAAAABBBBB - - - 01010110 lv.max.h Vector Half-Word Elements 
Maximum 
001010DDDDDAAAAABBBBB- --01010111 | Iv.merge.b | Vector Byte Elements Merge 
001010DDDDDAAAAABBBBB- - -01011000 | _ Iv.merge.h Vector Half-Word Elements 
Merge 
00101 0DDDDDAAAAABBBBB - - -01011001 lv.min.b Vector Byte Elements 
Minimum 
00101 0DDDDDAAAAABBBBB- - -01011010 lv.min.h Vector Half-Word Elements 
Minimum 
001010DDDDDAAAAABBBBB- - -01011011 | lv.msubs.h Vector Half-Word Elements 
Multiply Subtract Signed 
Saturated 
00101 0DDDDDAAAAABBBBB- - - 01011100 lv.muls.h Vector Half-Word Elements 
Multiply Signed Saturated 
00101 0DDDDDAAAAABBBBB.- - -01011101 lv.nand Vector Not And 
00101 0DDDDDAAAAABBBBB - - - 01011110 lv.nor Vector Not Or 
00101 0DDDDDAAAAABBBBB.- - -01011111 lv.or Vector Or 
001010DDDDDAAAAABBBBB- - -01100000 lv.pack.b Vector Byte Elements Pack 
001010DDDDDAAAAABBBBB- - -01100001 lv.pack.h Vector Half-word Elements 
Pack 
001010DDDDDAAAAABBBBB.- - -01100010 lv.packs.b Vector Byte Elements Pack 
Signed Saturated 
001010DDDDDAAAAABBBBB- - -01100011 lv.packs.h Vector Half-word Elements 
Pack Signed Saturated 
001010DDDDDAAAAABBBBB- - -01100100 | Iv.packus.b | Vector Byte Elements Pack 
Unsigned Saturated 
00101 0DDDDDAAAAABBBBB- - -01100101 | Iv.packus.h Vector Half-word Elements 
Pack Unsigned Saturated 
00101 0DDDDDAAAAABBBBB - - - 01100110 lv.perm.n Vector Nibble Elements 
Permute 
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Instruction Mnemonic Function Page 
00101 0DDDDDAAAAABBBBB- - -01100111 lv.rl.b Vector Byte Elements 
Rotate Left 
001010DDDDDAAAAABBBBB- - -01101000 Iv.rl.A Vector Half-Word Elements 
Rotate Left 
00101 0DDDDDAAAAABBBBB- - - 01101001 lv.sll.b Vector Byte Elements Shift 
Left Logical 
001010DDDDDAAAAABBBBB- - - 01101010 lv.sll.h Vector Half-Word Elements 
Shift Left Logical 
00101 0DDDDDAAAAABBBBB- - -01101011 lv.sll Vector Shift Left Logical 
00101 0DDDDDAAAAABBBBB- - - 01101100 lv.srl.b Vector Byte Elements Shift 
Right Logical 
00101 0DDDDDAAAAABBBBB.- - -01101101 lv.srl.h Vector Half-Word Elements 
Shift Right Logical 
00101 0DDDDDAAAAABBBBB - - - 01101110 lv.sra.b Vector Byte Elements Shift 
Right Arithmetic 
001010DDDDDAAAAABBBBB- - -01101111 lv.sra.h Vector Half-Word Elements 
Shift Right Arithmetic 
001.01 0DDDDDAAAAABBBBB- - - 01110000 Iv.srl Vector Shift Right Logical 
00101 0DDDDDAAAAABBBBB.- - -01110001 lv.sub.b Vector Byte Elements 
Subtract Signed 
001010DDDDDAAAAABBBBB- - - 01110010 lv.sub.h Vector Half-Word Elements 
Subtract Signed 
00101 0DDDDDAAAAABBBBB.- - -01110011 lv.subs.b Vector Byte Elements 
Subtract Signed Saturated 
001010DDDDDAAAAABBBBB- - - 01110100 lv.subs.h Vector Half-Word Elements 
Subtract Signed Saturated 
00101 0DDDDDAAAAABBBBB- - -01110101 lv.subu.b Vector Byte Elements 
Subtract Unsigned 
001010DDDDDAAAAABBBBB- - - 01110110 Iv.subu.h Vector Half-Word Elements 
Subtract Unsigned 
00101 0DDDDDAAAAABBBBB- - - 01110111 lv.subus.b Vector Byte Elements 
Subtract Unsigned 
Saturated 
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Instruction Mnemonic Function Page 
001010DDDDDAAAAABBBBB.- - - 01111000 Ilv.subus.h Vector Half-Word Elements 
Subtract Unsigned 
Saturated 
001010DDDDDAAAAABBBBB- - -01111001 | Iv.unpack.b Vector Byte Elements 
Unpack 
001010DDDDDAAAAABBBBB- - -01111010 | Iv.unpack.h | Vector Half-Word Elements 
Unpack 
001010DDDDDAAAAABBBBB- - -01111011 lv.xor Vector Exclusive Or 
010001---------- BBBBB- - - - - ------ |.jr Jump Register 
010010---------- BBBBB- ---------- I.jalr Jump and Link Register 
010011-- --- AAAAATIIIIIIIIIIIIIII I.maci Multiply Immediate Signed 
and Accumulate 
011011DDDDDAAAAATITIIIIIIIIIIIII l.lwa Load Single Word Atomic 
011101-------------------------- l.cust2 Reserved for ORBIS32/64 
Custom Instructions 
011110-------------------------- I.cust3 Reserved for ORBIS32/64 
Custom Instructions 
011111-------------------------- |.cust4 Reserved for ORBIS32/64 
Custom Instructions 
100000DDDDDAAAAATIIIIIIIIIIIIIII l.ld Load Double Word 
100001DDDDDAAAAATITIIIIIIIIIIIII l.lwz Load Single Word and 
Extend with Zero 
100010DDDDDAAAAATITIIIIIIIIIIIII l.lws Load Single Word and 
Extend with Sign 
100011DDDDDAAAAAIIIIIIIIIIIIIIII I.lbz Load Byte and Extend with 
Zero 
100100DDDDDAAAAATITIIIIIIIIIIIII l.lbs Load Byte and Extend with 
Sign 
100101DDDDDAAAAATITIIIIIIIIIIIII I.lhz Load Half Word and Extend 
with Zero 
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Instruction Mnemonic Function Page 
10011 0DDDDDAAAAATTIIIIIIIIIIIIIII l.lhs Load Half Word and Extend 
with Sign 
100111DDDDDAAAAATIIIIIIIIITIITIIII |.addi Add Immediate Signed 
101000DDDDDAAAAATTIIIIIIIIIIITIIII |.addic Add Immediate Signed and 
Carry 
101001DDDDDAAAAAKKKKKKKKKKKKKKKK |.andi And with Immediate Half 
Word 
10101 0DDDDDAAAAAKKKKKKKKKKKKKKKK |.ori Or with Immediate Half 
Word 
101011DDDDDAAAAATIIITIIIIIIIIIII |.xori Exclusive Or with Immediate 
Half Word 
1011 0@DDDDDAAAAATITIITIIIIIIIIIT |.muli Multiply Immediate Signed 
101101DDDDDAAAAAKKKKKKKKKKKKKKKK I.mfspr Move From Special-Purpose 
Register 
10111 0DDDDDAAAAA- - - - - - - - @OLLLLLL L.slli Shift Left Logical with 
Immediate 
10111 0DDDDDAAAAA- - - - - - - - @41LLLLLL I.srli Shift Right Logical with 
Immediate 
10111 ODDDDDAAAAA- - - - - - - - AOLLLLLL l.srai Shift Right Arithmetic with 
Immediate 
10111 0DDDDDAAAAA- - - - - - - - 14LLLLLL l.rori Rotate Right with Immediate 
101111 00000AAAAATITIIIIIIIIIIIII I.sfeqi Set Flag if Equal Immediate 
10111100001AAAAATIIIIIIIIIIIIIII |.sfnei Set Flag if Not Equal 
Immediate 
10111100010AAAAATIIIIIIIIIIIIIII |.sfgtui Set Flag if Greater Than 
Immediate Unsigned 
10111100011AAAAATIIITIIIIIIIIIII |.sfgeui Set Flag if Greater or Equal 
Than Immediate Unsigned 
10111100100AAAAATIIITIIIIIIIIIII |.sfltui Set Flag if Less Than 
Immediate Unsigned 
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10111100101AAAAATIIIITIIIIIIIIIII |.sfleui Set Flag if Less or Equal 
Than Immediate Unsigned 
10111101010AAAAATIIIIIIIIIIIIIII |.sfgtsi Set Flag if Greater Than 
Immediate Signed 
10111101011AAAAATIIIITIIIIIIITIIII |.sfgesi Set Flag if Greater or Equal 
Than Immediate Signed 
10111101100AAAAATIIITIIIIIIIIIII |.sfltsi Set Flag if Less Than 
Immediate Signed 
10111101101AAAAATIIIITIIIIIIIIIII |.sflesi Set Flag if Less or Equal 
Than Immediate Signed 
110000KKKKKAAAAABBBBBKKKKKKKKKKK I.mtspr Move To Special-Purpose 
Register 
110001 - - --- AAAAABBBBB- - - - - - - 0001 I.mac Multiply Signed and 
Accumulate 
110001----- AAAAABBBBB- - - - - - - 0011 I.macu Multiply Unsigned and 
Accumulate 
110001 - - --- AAAAABBBBB- - - - - - - 0010 I.msb Multiply Signed and 
Subtract 
110001- - --- AAAAABBBBB- - - - - - - 0100 I.msbu Multiply Unsigned and 
Subtract 
110010- ---- AAAAABBBBB- - -00001000 If.sfeq.s Set Flag if Equal Floating- 
Point Single-Precision 
110010- ---- AAAAABBBBB.- - -00001001 lf.sfne.s Set Flag if Not Equal 
Floating-Point Single- 
Precision 
110010- ---- AAAAABBBBB.- - -00001010 If.sfgt.s Set Flag if Greater Than 
Floating-Point Single- 
Precision 
110010- - -- - AAAAABBBBB- - -00001011 If.sfge.s Set Flag if Greater or Equal 
Than Floating-Point Single- 
Precision 
110010- ---- AAAAABBBBB.- - -00001100 If.sflt.s Set Flag if Less Than 
Floating-Point Single- 
Precision 
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110010- ---- AAAAABBBBB- - -Q0001101 If.sfle.s Set Flag if Less or Equal 
Than Floating-Point Single- 
Precision 
110010- - --- AAAAABBBBB- - -00011000 If.sfeq.d Set Flag if Equal Floating- 


Point Double-Precision 





110010- ---- AAAAABBBBB.- - -00011001 lf.sfne.d Set Flag if Not Equal 
Floating-Point Double- 
Precision 





110010- ---- AAAAABBBBB.- - -00011010 If.sfgt.d Set Flag if Greater Than 
Floating-Point Double- 
Precision 





110010- - --- AAAAABBBBB- - -00011011 If.sfge.d Set Flag if Greater or Equal 
Than Floating-Point Double- 
Precision 





110010- ---- AAAAABBBBB.- - -00011100 If.sflt.d Set Flag if Less Than 
Floating-Point Double- 
Precision 





110010- ---- AAAAABBBBB.- - -00011101 lf.sfle.d Set Flag if Less or Equal 
Than Floating-Point Double- 
Precision 





110010----- AAAAABBBBB-- - -1101- - - - If.custL.s Reserved for ORFPX32 
Custom Instructions 


110010----- AAAAABBBBB- - -1110- - - - If.custL.d Reserved for ORFPX64 
Custom Instructions 





11001 0DDDDDAAAAA00000 - - -00000100 If.itof.s Integer To Floating-Point 
Single-Precision 





11001 0DDDDDAAAAA00000 - - -00000101 If.ftoi.s Floating-Point Single- 
Precision To Integer 


110010DDDDDAAAAAO00000 - - -00010100 If.itof.d Integer To Floating-Point 
Double-Precision 





11001 0DDDDDAAAAA00000 - - -00010101 If. ftoi.d Floating-Point Double- 
Precision To Integer 


11001 0DDDDDAAAAABBBBB- - - 00000000 lf.add.s Add Floating-Point Single- 
Precision 
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11001 0DDDDDAAAAABBBBB - - -00000001 If.sub.s Subtract Floating-Point 
Single-Precision 
11001 0DDDDDAAAAABBBBB - - - 00000010 lf.mul.s Multiply Floating-Point 
Single-Precision 
11001 0DDDDDAAAAABBBBB.- - -00000011 If.div.s Divide Floating-Point Single- 
Precision 
110010DDDDDAAAAABBBBB- - - 00000110 If.rem.s Remainder Floating-Point 
Single-Precision 
11001 0DDDDDAAAAABBBBB- - - 00000111 If.madd.s Multiply and Add Floating- 
Point Single-Precision 
11001 0DDDDDAAAAABBBBB - - -00010000 lf.add.d Add Floating-Point Double- 
Precision 
11001 0DDDDDAAAAABBBBB - - -00010001 If.sub.d Subtract Floating-Point 
Double-Precision 
11001 0DDDDDAAAAABBBBB- - - 00010010 If.mul.d Multiply Floating-Point 
Double-Precision 
11001 0DDDDDAAAAABBBBB - - -00010011 If.div.d Divide Floating-Point 
Double-Precision 
11001 0DDDDDAAAAABBBBB - - -00010110 If.rem.d Remainder Floating-Point 
Double-Precision 
11001 0DDDDDAAAAABBBBB- - -©0010111 If. madd.d Multiply and Add Floating- 
Point Double-Precision 
110011 TITITAAAAABBBBBIITIIIIIIIII |.swa Store Single Word Atomic 
110101TITITAAAAABBBBBIIIIIIIIIII L.Sw Store Single Word 
110110ITITITAAAAABBBBBITIIIIIIIII I.sb Store Byte 
110111 TITITAAAAABBBBBIIIIIIIIIII I.sh Store Half Word 
111000DDDDDAAAAA - - - - - - 0000 - -1100 l.exths Extend Half Word with Sign 
111000DDDDDAAAAA - - - - - - 0000- -1101 l.extws Extend Word with Sign 
111 0OODDDDDAAAAA- - - - - - Q001--1100 l.extbs Extend Byte with Sign 
111 0O00DDDDDAAAAA - - - - - - 0001--1101 l.extwz Extend Word with Zero 
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111000DDDDDAAAAA - - - - - - 0010 - -1100 l.exthz Extend Half Word with Zero 
111000DDDDDAAAAA - - - - - - 0011 - -1100 l.extbz Extend Byte with Zero 
111000DDDDDAAAAABBBBB - 00 - - - -0000 |.add Add Signed 
111000DDDDDAAAAABBBBB - 00 - - - -0001 l.addc Add Signed and Carry 
111000DDDDDAAAAABBBBB - 00 - - - -0010 |.sub Subtract Signed 
111000DDDDDAAAAABBBBB - 00 - - - -0011 l.and And 
111000DDDDDAAAAABBBBB - 00 - - - -0100 l.or Or 
111000DDDDDAAAAABBBBB - 00 - - - -0101 I.xor Exclusive Or 
111000DDDDDAAAAABBBBB - 00 - - - - 1110 l.cmov Conditional Move 
111 000DDDDDAAAAA - - - - - - 0O----1111 |. ff. Find First 1 
111000DDDDDAAAAABBBBB - 0000 - - 1000 I.sll Shift Left Logical 
111000DDDDDAAAAABBBBB - 0001 - - 1000 I.srl Shift Right Logical 
111000DDDDDAAAAABBBBB - 0010 - - 1000 l.sra Shift Right Arithmetic 
111000DDDDDAAAAABBBBB - 0011 - - 1000 l.ror Rotate Right 
111000DDDDDAAAAA - - - - - - @1----1111 1.f1. Find Last 1 
111000DDDDDAAAAABBBBB - 11 - - - -0110 I.mul Multiply Signed 
111000-- -- - AAAAABBBBB- 11- - - -0111 I.muld Multiply Signed to Double 
111000DDDDDAAAAABBBBB- 11 - - - - 1001 l.div Divide Signed 
111000DDDDDAAAAABBBBB - 11 - - - - 1010 l.divu Divide Unsigned 
111 @O@DDDDDAAAAABBBBB- 11 - - - -1011 |.mulu Multiply Unsigned 
111000-- -- - AAAAABBBBB - 11- - - -1100 I.muldu Multiply Unsigned to Double 
11100100000AAAAABBBBB- - - - - - - - - - - l.sfeq Set Flag if Equal 
11100100001AAAAABBBBB- - - - - - ----- |.sfne Set Flag if Not Equal 
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1110010001 0AAAAABBBBB- - - - - - - -- - - |.sfgtu Set Flag if Greater Than 
Unsigned 
11100100011 AAAAABBBBB- - - - - - - - - - - l.sfgeu Set Flag if Greater or Equal 
Than Unsigned 
11100100100AAAAABBBBB- - - - - - - - - -- |.sfltu Set Flag if Less Than 
Unsigned 
11100100101AAAAABBBBB- - - - - - ----- I.sfleu Set Flag if Less or Equal 
Than Unsigned 
11100101010AAAAABBBBB- - - - - - - ---- l.sfgts Set Flag if Greater Than 
Signed 
11100101011 AAAAABBBBB- - - - - - - - - - - l.sfges Set Flag if Greater or Equal 
Than Signed 
11100101100AAAAABBBBB- - - - - - - - - - - I.sflts Set Flag if Less Than 
Signed 
11100101101AAAAABBBBB- - - - - - - - - - - l.sfles Set Flag if Less or Equal 
Than Signed 
111100DDDDDAAAAABBBBBLLLLLLKKKKK I.cust5 Reserved for ORBIS32/64 
Custom Instructions 
111101-------------------------- l.cust6 Reserved for ORBIS32/64 
Custom Instructions 
111110-------------------------- l.cust7 Reserved for ORBIS32/64 
Custom Instructions 
111111-------------------------- l.cust8 Reserved for ORBIS32/64 
Custom Instructions 
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